x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Considerations in diffuse control — AI Alignment Forum
Considerations in diffuse control
6
Methodological considerations in making malign initializations for control research
Alek Westover
2mo
0
2
Three visions for diffuse control
Alek Westover
12d
0
17
Four Downsides of Training Policies Online
Alek Westover
2mo
0
4
Theoretical predictions on the sample efficiency of training policies and activation monitors
Alek Westover
1mo
0
6
How will we do SFT on models with opaque reasoning?
Alek Westover
,
Vivek Hebbar
9h
0