AI ALIGNMENT FORUM
AF

332
Jett Janiak
Ω27000
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
29Polysemantic Attention Head in a 4-Layer Transformer
2y
0
8An adversarial example for Direct Logit Attribution: memory management in gelu-4l
2y
0
36A circuit for Python docstrings in a 4-layer attention-only transformer
3y
3