Joseph Miller

Message

2996

291

Gradient Routing: Masking Gradients to Localize Computation in Neural Networks

by cloud, Jacob G-W, Evzen, Joseph Miller, and TurnTrout

We present gradient routing, a way of controlling where learning happens in neural networks. Gradient routing applies masks to limit the flow of gradients during backpropagation. By supplying different masks for different data points, the user can induce specialized subcomponents within a model. We think gradient routing has the potential...

Dec 6, 2024•179

Even Superhuman Go AIs Have Surprising Failure Modes

by AdamGleave, EuanMcLean, Tony Wang, Kellin Pelrine, Tom Tseng, Yawen Duan, Joseph Miller, and MichaelDennis

In March 2016, AlphaGo defeated the Go world champion Lee Sedol, winning four games to one. Machines had finally become superhuman at Go. Since then, Go-playing AI has only grown stronger. The supremacy of AI over humans seemed assured, with Lee Sedol commenting they are an "entity that cannot be...

Jul 20, 2023•131

We Found An Neuron in GPT-2

We started out with the question: How does GPT-2 know when to use the word "an" over "a"? The choice depends on whether the word that comes after starts with a vowel or not, but GPT-2 can only output one word at a time. We still don’t have a full...

Feb 11, 2023•143