This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Transformer Circuits
•
Applied to
An Analogy for Understanding Transformers
by
TheMcDouglas
19d
ago
•
Applied to
Finding Neurons in a Haystack: Case Studies with Sparse Probing
by
Wes Gurnee
1mo
ago
•
Applied to
Explaining the Transformer Circuits Framework by Example
by
RobertM
1mo
ago
•
Applied to
Addendum: More Efficient FFNs via Attention
by
Robert_AIZI
4mo
ago
•
Applied to
No Really, Attention is ALL You Need - Attention can do feedforward networks
by
Robert_AIZI
4mo
ago
•
Applied to
Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind
by
Cinera Verinia
5mo
ago
•
Applied to
200 Concrete Open Problems in Mechanistic Interpretability: Introduction
by
Neel Nanda
5mo
ago
•
Applied to
200 COP in MI: Techniques, Tooling and Automation
by
Neel Nanda
5mo
ago
•
Applied to
200 COP in MI: Analysing Training Dynamics
by
Neel Nanda
5mo
ago
•
Applied to
200 COP in MI: Exploring Polysemanticity and Superposition
by
Neel Nanda
5mo
ago
•
Applied to
200 COP in MI: Interpreting Algorithmic Problems
by
Neel Nanda
5mo
ago
•
Applied to
200 COP in MI: Looking for Circuits in the Wild
by
Neel Nanda
5mo
ago
•
Applied to
A Walkthrough of In-Context Learning and Induction Heads (w/ Charles Frye) Part 1 of 2
by
Neel Nanda
6mo
ago
•
Applied to
A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)
by
Neel Nanda
7mo
ago
•
Applied to
Anthropic's SoLU (Softmax Linear Unit)
by
Joel Burget
1y
ago
•
Applied to
Understanding the tensor product formulation in Transformer Circuits
by
Ruben Bloom
1y
ago
•
Created by
Ruben Bloom
at
1y