OurBigBook
About
$
Donate
Sign in
Sign up
by
Ciro Santilli
(
@cirosantilli,
34
)
Reward modeling
...
Area of technology
Information technology
Computer
Machine learning
Artificial intelligence
AI alignment
Like
(0)
0 By others
on same topic
0 Discussions
Updated
2024-11-15
Created
1970-01-01
See my version
See e.g.:
Human Compatible
deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
Ancestors
(8)
AI alignment
Artificial intelligence
Machine learning
Computer
Information technology
Area of technology
Technology
Home
Incoming links
(1)
Human Compatible
View article source
Discussion
(0)
Subscribe (1)
New discussion
There are no discussions about this article yet.
Articles by others on the same topic
(0)
There are currently no matching articles.
See all articles in the same topic
Create my own version