On Developing a Mathematical Theory of Interpretability — AI Alignment Forum