This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Machine Learning (ML)
•
Applied to
Basic Mathematics of Predictive Coding
by
Adam Shai
2d
ago
•
Applied to
Discursive Competence in ChatGPT, Part 2: Memory for Texts
by
Bill Benzon
3d
ago
•
Applied to
Influence functions - why, what and how
by
Nina Rimsky
16d
ago
•
Applied to
Mech Interp Challenge: September - Deciphering the Addition Model
by
TheMcDouglas
18d
ago
•
Applied to
Expanding the Scope of Superposition
by
Derek Larson
19d
ago
•
Applied to
Explaining grokking through circuit efficiency
by
Vikrant Varma
23d
ago
•
Applied to
Report on Analyzing Connotation Frames in Evolving Wikipedia Biographies
by
Maira Elahi
1mo
ago
•
Applied to
Apply to a small iteration of MLAB to be run in Oxford
by
RP
1mo
ago
•
Applied to
Is this the beginning of the end for LLMS [as the royal road to AGI, whatever that is]?
by
Bill Benzon
1mo
ago
•
Applied to
Causality and a Cost Semantics for Neural Networks
by
scottviteri
1mo
ago
•
Applied to
Against Almost Every Theory of Impact of Interpretability
by
Charbel-Raphael Segerie
1mo
ago
•
Applied to
Google DeepMind's RT-2
by
SandXbox
2mo
ago
•
Applied to
The positional embedding matrix and previous-token heads: how do they actually work?
by
Adam Yedidia
2mo
ago
•
Applied to
Mech Interp Challenge: August - Deciphering the First Unique Character Model
by
TheMcDouglas
2mo
ago