x
Backdoors have universal representations across large language models — AI Alignment Forum