x
A Toy Environment For Exploring Reasoning About Reward — AI Alignment Forum