200 COP in MI: Interpreting Reinforcement Learning — AI Alignment Forum