Delegative Reinforcement Learning with a Merely Sane Advisor — AI Alignment Forum