This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Jett Janiak
Jett Janiak
Posts
Sorted by New
29
Polysemantic Attention Head in a 4-Layer Transformer
6mo
0
8
An adversarial example for Direct Logit Attribution: memory management in gelu-4l
8mo
0
34
A circuit for Python docstrings in a 4-layer attention-only transformer
1y
2
Wiki Contributions
Comments