What happens to variance as neural network training is scaled? What does it imply about "lottery tickets"? — AI Alignment Forum