x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Mechanistic Interpretability Puzzles — AI Alignment Forum
Mechanistic Interpretability Puzzles
27
Mech Interp Puzzle 1: Suspiciously Similar Embeddings in GPT-Neo
Neel Nanda
2y
2
17
Mech Interp Puzzle 2: Word2Vec Style Embeddings
Neel Nanda
2y
2