Trying to Make a Treacherous Mesa-Optimizer — AI Alignment Forum