Mechanistic Interpretability Many of you will be familiar with the following section. Please skip through to the next. The field of mechanistic interpretability (MI) is not a single, monolithic research program but rather a rapidly evolving collection of methods, tools, and research programs. These are united by the shared ambition...