x
Why Not Just Train For Interpretability? — AI Alignment Forum