Aligning a toy model of optimization — AI Alignment Forum