x
Weight-Sparse Circuits May Be Interpretable Yet Unfaithful — AI Alignment Forum