Seriously, what goes wrong with "reward the agent when it makes you smile"? — AI Alignment Forum