Mechanistic Anomaly Detection Research Update — AI Alignment Forum