OurBigBook
About
$
Donate
Sign in
Sign up
by
Ciro Santilli
(
@cirosantilli,
32
)
Reward modeling
See e.g.:
Human Compatible
deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
Ancestors
AI alignment
Artificial intelligence
Machine learning
Computer
Information technology
Area of technology
Technology
Index
Incoming links
Human Compatible
View article source
Discussion (0)
Subscribe (1)
New discussion
There are no discussions about this article yet.
Articles by others on the same topic (0)
There are currently no matching articles.
See all articles in the same topic
Create my own version