AI ALIGNMENT FORUM
AF

Jett Janiak
Ω27000
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
No Comments Found
29Polysemantic Attention Head in a 4-Layer Transformer
2y
0
8An adversarial example for Direct Logit Attribution: memory management in gelu-4l
2y
0
36A circuit for Python docstrings in a 4-layer attention-only transformer
3y
3