algon33

Posts

Sorted by New

Wiki Contributions

Comments

My Understanding of Paul Christiano's Iterated Amplification AI Safety Research Agenda

This post deserves a strong upvote. Since you've done the review, would you mind answering a reference request? What papers/blog posts represent Paul's current views on corrigibility?