Charbel-Raphaël and Lucius discuss interpretability — AI Alignment Forum