OurBigBook
About
$
Donate
Sign in
+
Sign up
Reward modeling
New to
topics
?
Read the documentation here!
Top articles
Latest articles
+
New article in topic
Show body
Body
0
Reward modeling
by
Ciro Santilli
34
Updated
2024-12-15
+
Created
1970-01-01
See e.g.:
Human Compatible
deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
Total
articles
:
1