x
Rational Animations' intro to mechanistic interpretability — AI Alignment Forum