x
Selective regularization for alignment-focused representation engineering — AI Alignment Forum