x

AI ALIGNMENT FORUM

AF

Alex Sanchez-Stern — AI Alignment Forum

Alex Sanchez-Stern

Alex Sanchez-Stern

Message

9

1

4y

Alex Sanchez-Stern

9

4y

How Language Models Understand Nullability

by Anish Tondwalkar and Alex Sanchez-Stern

TL;DR Large language models have demonstrated an emergent ability to write code, but this ability requires an internal representation of program semantics that is little understood. Recent interpretability work has demonstrated that it is possible to extract internal representations of natural language concepts, raising the possibility that similar techniques could...

Mar 11, 2025•5