Redirect Notice
The previous page is sending you to
https://medium.com/@deepmindsafetyresearch/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84
.
If you do not want to visit that page, you can
return to the previous page
.