OurBigBook
About
$
Donate
Sign in
Sign up
Reward modeling
New to
topics
?
Read the documentation here!
Top articles
Latest articles
New article in topic
Show body
Total articles:
1
0
Reward modeling
by
Ciro Santilli
34
Updated
2024-11-19
Created
1970-01-01
See e.g.:
Human Compatible
deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
Total articles:
1