Reward function learning: the learning process — AI Alignment Forum