x

AI ALIGNMENT FORUM

AF

Transformers — AI Alignment Forum

Transformers

This page is a stub.

Add Posts

1

1

Posts tagged Transformers

1

62How LLMs are and are not myopic

3y

7

1

74Modern Transformers are AGI, and Human-Level

2y

31

1

60[Linkpost] Interpreting Language Model Parameters

Lucius Bushnaq, Dan Braun, Oliver Clive-Griffin, Bart Bussmann, Nathan Hu, mivanitskiy, Linda Linsefors, Lee Sharkey

3mo

1

2

33Residual stream norms grow exponentially over the forward pass

StefanHex, TurnTrout

3y

6

1

27Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind

4y

9

0

17Concrete Steps to Get Started in Transformer Mechanistic Interpretability

4y

5

1

8AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

3y

0

1

145Transformers Represent Belief State Geometry in their Residual Stream

2y

4

1

28Attention SAEs Scale to GPT-2 Small

Connor Kissane, robertzk, Arthur Conmy, Neel Nanda

2y

0

1

20Brief Notes on Transformers

4y

2

1

21Understanding mesa-optimization using toy models

tilmanr, rusheb, Guillaume Corlouer, Dan Valentine, afspies, mivanitskiy, Can

3y

0

1

17Building a transformer from scratch - AI safety up-skilling challenge

Marius Hobbhahn

4y

0

0

13Deconfusing In-Context Learning

Arjun Panickssery

2y

0

1

16New Tool: the Residual Stream Viewer

3y

1

1

9The positional embedding matrix and previous-token heads: how do they actually work?

3y

1

Load More (15/15)

Add Posts