This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
CAST: Corrigibility As Singular Target
54
0. CAST: Corrigibility as Singular Target
Max Harms
1y
4
18
1. The CAST Strategy
Max Harms
1y
17
24
2. Corrigibility Intuition
Max Harms
1y
9
12
3a. Towards Formal Corrigibility
Max Harms
1y
2
13
3b. Formal (Faux) Corrigibility
Max Harms
1y
12
21
4. Existing Writing on Corrigibility
Max Harms
1y
10
10
5. Open Corrigibility Questions
Max Harms
1y
0