Less exploitable value-updating agent — AI Alignment Forum