x
Empirical mechanistic anomaly detection — AI Alignment Forum