AI ALIGNMENT FORUM
AF

295
Jett Janiak
Ω27000
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
29Polysemantic Attention Head in a 4-Layer Transformer
2y
0
8An adversarial example for Direct Logit Attribution: memory management in gelu-4l
2y
0
36A circuit for Python docstrings in a 4-layer attention-only transformer
3y
3
No wikitag contributions to display.