AI ALIGNMENT FORUM
AF

HomeLibraryQuestionsAll Posts
About
HomeLibraryQuestionsAll Posts

Recommended Sequences

Embedded Agency
by Abram Demski
Cartesian Frames
by Scott Garrabrant
Iterated Amplification
by Paul Christiano

AI Alignment Posts

27Introducing the AI Alignment Forum (FAQ)
Oliver Habryka, Ben Pace, Raymond Arnold, Jim Babcock
2y
0
14Why I'm excited about Debate
Richard Ngo
16h
0
14Thoughts on Iason Gabriel’s Artificial Intelligence, Values, and Alignment
Alex Flint
2d
12
4Some recent survey papers on (mostly near-term) AI safety, security, and assurance
Aryeh Englander
3d
0
11[AN #133]: Building machines that can cooperate (with humans, institutions, or other machines)
Rohin Shah
3d
0
15Review of 'Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More'
Alex Turner
4d
1
27Transparency and AGI safety
jylin04
5d
5
7Prediction can be Outer Aligned at Optimum
Lukas Finnveden
6d
6
32Review of Soft Takeoff Can Still Lead to DSA
Daniel Kokotajlo
6d
1
32Imitative Generalisation (AKA 'Learning the Prior')
Beth Barnes
7d
5
Load More

Recent Discussion

Load More