AI ALIGNMENT FORUM
AF

Wikitags

Transformers

This page is a stub.
Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged Transformers
61How LLMs are and are not myopic
janus
2y
7
69Modern Transformers are AGI, and Human-Level
Abram Demski
1y
30
33Residual stream norms grow exponentially over the forward pass
Stefan Heimersheim, Alex Turner
2y
6
27Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind
Cinera Verinia
2y
9
17Concrete Steps to Get Started in Transformer Mechanistic Interpretability
Neel Nanda
3y
5
8AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them
Roman Leventov
2y
0
143Transformers Represent Belief State Geometry in their Residual Stream
Adam Shai
1y
4
28Attention SAEs Scale to GPT-2 Small
Connor Kissane, Robert Krzyzanowski, Arthur Conmy, Neel Nanda
1y
0
20Brief Notes on Transformers
Adam Jermyn
3y
2
21Understanding mesa-optimization using toy models
tilmanr, rusheb, Guillaume Corlouer, Dan Valentine, Alex Spies, Michael Ivanitskiy, Can
2y
0
17Building a transformer from scratch - AI safety up-skilling challenge
Marius Hobbhahn
3y
0
13Deconfusing In-Context Learning
Arjun Panickssery
1y
0
16New Tool: the Residual Stream Viewer
Adam Yedidia
2y
1
9The positional embedding matrix and previous-token heads: how do they actually work?
Adam Yedidia
2y
1
Add Posts