x
Mesa-optimization for goals defined only within a training environment is dangerous — AI Alignment Forum