A toy model of the treacherous turn — AI Alignment Forum