Box inversion hypothesis — AI Alignment Forum