Beware of black boxes in AI alignment research — AI Alignment Forum