x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
CAST: Corrigibility As Singular Target — AI Alignment Forum
CAST: Corrigibility As Singular Target
50
0. CAST: Corrigibility as Singular Target
Max Harms
2y
6
23
1. The CAST Strategy
Max Harms
2y
19
27
2. Corrigibility Intuition
Max Harms
2y
9
13
3a. Towards Formal Corrigibility
Max Harms
2y
2
13
3b. Formal (Faux) Corrigibility
Max Harms
2y
16
26
4. Existing Writing on Corrigibility
Max Harms
2y
11
11
5. Open Corrigibility Questions
Max Harms
2y
0
53
Serious Flaws in CAST
Max Harms
2mo
1