This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Home
Library
Questions
All Posts
About
Recommended Sequences
AGI safety from first principles
by
Richard Ngo
Embedded Agency
by
Abram Demski
2022 MIRI Alignment Discussion
by
Rob Bensinger
AI Alignment Posts
50
Welcome & FAQ!
Ruben Bloom
,
Oliver Habryka
2y
8
15
Updating Drexler's CAIS model
Matthew Barnett
1d
9
16
[Replication] Conjecture's Sparse Coding in Small Transformers
Hoagy
,
Logan Riggs Smith
1d
0
57
LLMs Sometimes Generate Purely Negatively-Reinforced Text
Fabien Roger
2d
2
26
AXRP Episode 22 - Shard Theory with Quintin Pope
DanielFilan
2d
2
12
Instrumental Convergence? [Draft]
J. Dmitri Gallow
3d
0
23
MetaAI: less is less for alignment.
Cleo Nardo
5d
3
[Event]
Virtual AI Safety Unconference (VAISU)
Jul 31st
0
27
TASRA: A Taxonomy and Analysis of Societal-Scale Risks from AI
Andrew Critch
5d
0
16
Contingency: A Conceptual Tool from Evolutionary Biology for Alignment
clem_acs
5d
0
Recent Discussion
Load More