Computing an exact quantilal policy — AI Alignment Forum