x
ActAdd: Steering Language Models without Optimization — AI Alignment Forum