x
Sleeper agents appear resilient to activation steering — AI Alignment Forum