x
Counterfactual control incentives — AI Alignment Forum