[AN #89]: A unifying formalism for preference learning algorithms — AI Alignment Forum