Twitter thread on AI safety evals — AI Alignment Forum