Gradient descent might see the direction of the optimum from far away — AI Alignment Forum