x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Jett Janiak — AI Alignment Forum
Jett Janiak
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
29
Polysemantic Attention Head in a 4-Layer Transformer
2y
0
8
An adversarial example for Direct Logit Attribution: memory management in gelu-4l
2y
0
36
A circuit for Python docstrings in a 4-layer attention-only transformer
3y
3
Comments