This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
571
AI ALIGNMENT FORUM
AF
Login
570
Thought Crime: Backdoors & Emergent Misalignment in Reasoning Models — AI Alignment Forum