Source: cirosantilli/reward-modeling
= Reward modeling
See e.g.: <Human Compatible>
* https://deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
= Reward modeling
See e.g.: <Human Compatible>
* https://deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84