This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
3195
Matthew Rahtz — AI Alignment Forum
Matthew Rahtz
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
24
Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla
2y
0
20
Specification gaming: the flip side of AI ingenuity
5y
9
Comments