x
Realistic Reward Hacking Induces Different and Deeper Misalignment — AI Alignment Forum