The alignment stability problem — AI Alignment Forum