x
Exploring unfaithful/deceptive CoT in reasoning models — AI Alignment Forum