Inverse reinforcement learning on self, pre-ontology-change — AI Alignment Forum