AI ALIGNMENT FORUM
AF

916
Joseph Miller
Ω53100
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
2Joseph Miller's Shortform
1y
0
Fact Finding: Simplifying the Circuit (Post 2)
Joseph Miller2y10

What's up with the <pad> token (<pad>==<bos>==<eos> in Pythia) in the attention diagram? I assume that doesn't need to be there?

Reply
64Gradient Routing: Masking Gradients to Localize Computation in Neural Networks
10mo
4
34Even Superhuman Go AIs Have Surprising Failure Modes
2y
9
53We Found An Neuron in GPT-2
3y
0