Curated Sequences

Embedded Agency
Iterated Amplification
Value Learning
Risks from Learned Optimization

Community Sequences

Understanding Goal-Directedness
Cartesian Frames
Infra-Bayesianism
Thoughts on Goal-Directedness
Consequences of Logical Induction
Subagents and impact measures
If I were a well-intentioned AI...
Partial Agency
AI Alignment Writing Day 2019
Logical Counterfactuals and Proposition graphs
AI Alignment Writing Day 2018
Reframing Impact
Load More (12/16)