How to get value learning and reference wrong — AI Alignment Forum