What is a training "step" vs. "episode" in machine learning? — AI Alignment Forum