AI ALIGNMENT FORUMTags
AF

Machine Learning (ML)

•
Applied to Basic Mathematics of Predictive Coding by Adam Shai 2d ago
•
Applied to Discursive Competence in ChatGPT, Part 2: Memory for Texts by Bill Benzon 3d ago
•
Applied to Influence functions - why, what and how by Nina Rimsky 16d ago
•
Applied to Mech Interp Challenge: September - Deciphering the Addition Model by TheMcDouglas 18d ago
•
Applied to Expanding the Scope of Superposition by Derek Larson 19d ago
•
Applied to Explaining grokking through circuit efficiency by Vikrant Varma 23d ago
•
Applied to Report on Analyzing Connotation Frames in Evolving Wikipedia Biographies by Maira Elahi 1mo ago
•
Applied to Apply to a small iteration of MLAB to be run in Oxford by RP 1mo ago
•
Applied to Is this the beginning of the end for LLMS [as the royal road to AGI, whatever that is]? by Bill Benzon 1mo ago
•
Applied to Causality and a Cost Semantics for Neural Networks by scottviteri 1mo ago
•
Applied to Against Almost Every Theory of Impact of Interpretability by Charbel-Raphael Segerie 1mo ago
•
Applied to Google DeepMind's RT-2 by SandXbox 2mo ago
•
Applied to The positional embedding matrix and previous-token heads: how do they actually work? by Adam Yedidia 2mo ago
•
Applied to Mech Interp Challenge: August - Deciphering the First Unique Character Model by TheMcDouglas 2mo ago