Source: /cirosantilli/reward-modeling

= Reward modeling

See e.g.: <Human Compatible>
* https://deepmindsafetyresearch.medium.com/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84