x
Weird Generalization & Inductive Backdoors — AI Alignment Forum