x
Full toy model for preference learning — AI Alignment Forum