[Link] Why I’m excited about AI-assisted human feedback

janleike

17 [Link] Why I’m excited about AI-assisted human feedback

by janleike

6th Apr 2022

1 min read

0

17

This is a link post for https://aligned.substack.com/p/ai-assisted-human-feedback

I'm writing a sequence of posts on the approach to alignment I'm currently most excited about. This first post argues for recursive reward modeling and the problem it's meant to address (scaling RLHF to tasks that are hard to evaluate).

RLHFAI

Frontpage

New Comment

Moderation Log