Human in the counterfactual loop — AI Alignment Forum