AI ALIGNMENT FORUM
AF

222
Wikitags

Lottery Ticket Hypothesis

Edited by Multicore last updated 31st May 2021

The Lottery Ticket Hypothesis claims that neural networks used in machine learning get most of their performance from sub-networks that are already present at initialization that approximate the final policy ("winning tickets"). The training process would, under this model, work by increasing weight on the lottery ticket sub-network and reducing weight on the rest of the network.

The hypothesis was proposed in a paper by Jonathan Frankle and Micheal Carbin of MIT CSAIL.

Subscribe
Discussion
Subscribe
Discussion
Posts tagged Lottery Ticket Hypothesis
106A Mechanistic Interpretability Analysis of Grokking
Neel Nanda, Tom Lieberum
3y
18
53Understanding “Deep Double Descent”
evhub
6y
35
42Gradations of Inner Alignment Obstacles
abramdemski
4y
19
40Updating the Lottery Ticket Hypothesis
johnswentworth
4y
24
21Understanding the Lottery Ticket Hypothesis
Alex Flint
4y
5
13What happens to variance as neural network training is scaled? What does it imply about "lottery tickets"?
Q
abramdemski, evhub
5y
Q
3
7Does the lottery ticket hypothesis suggest the scaling hypothesis?
Q
Daniel Kokotajlo
5y
Q
17
35Why Neural Networks Generalise, and Why They Are (Kind of) Bayesian
Joar Skalse
5y
53
Add Posts