AI ALIGNMENT FORUM
AF

Sid Black
Ω44000
Message
Subscribe to posts

Posts

Sorted by New
69The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable
10mo
11
33Conjecture Second Hiring Round
10mo
0
65Conjecture: a retrospective after 8 months of work
10mo
5
38Current themes in mechanistic interpretability research
1y
2
44Interpreting Neural Networks through the Polytope Lens
1y
11
54Conjecture: Internal Infohazard Policy
1y
2

Wiki Contributions

No wiki contributions to display.

Comments

No Comments Found