x
Can we interpret latent reasoning using current mechanistic interpretability tools? — AI Alignment Forum