[AN #73]: Detecting catastrophic failures by learning how agents tend to break — AI Alignment Forum