OurBigBook
.com (beta)
About
$ Donate
Sign in
Sign up
by
Ciro Santilli
(@cirosantilli,
32
)
Reward modeling
See e.g.:
Human Compatible
deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
Ancestors
AI alignment
Artificial intelligence
Machine learning
Computer
Information technology
Area of technology
Technology
Index
Incoming links
Human Compatible
Discussion (0)
Subscribe (1)
Sign up
or
sign in
create discussions.
There are no discussions about this article yet.
View article source