x

AI ALIGNMENT FORUM

AF

Baram Sosis — AI Alignment Forum

Baram Sosis

Top postsTop post

Baram Sosis

Message

313

Ω

5

4

18

1y

Baram Sosis

313

Ω

5

1y

Measuring Beliefs of Language Models During Chain-of-Thought Reasoning

Based on research performed as a PIBBSS Fellow with Tomáš Gavenčiak as well as work supported by EA Funds and Open Philanthropy. tl;dr: I'm investigating whether LLMs track and update beliefs during chain-of-thought reasoning. Preliminary experiments with older models (without reasoning training) have not been able to measure this; I...

Apr 18, 2025•12