Michaël Trazzi

Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs

Owain Evans is an AI Alignment researcher, research associate at the Center of Human Compatible AI at UC Berkeley, and now leading a new AI safety research group. In this episode we discuss two of his recent papers, “Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs” (LW)...

Aug 24, 202456

Eric Michaud on the Quantization Model of Neural Scaling, Interpretability and Grokking

Eric is a PhD student in the Department of Physics at MIT working with Max Tegmark on improving our scientific/theoretical understanding of deep learning -- understanding what deep neural networks do internally and why they work so well. We mostly talk about Eric's paper, The Quantization Model of Neural Scaling,...

Jul 12, 202310

Jesse Hoogland on Developmental Interpretability and Singular Learning Theory

Jesse Hoogland is a research assistant at David Krueger's lab in Cambridge studying AI Safety who has recently been publishing on LessWrong about how to apply Singular Learning Theory to Alignment, and even organized some workshop in Berkeley last week around this. I thought it made sense to interview him...

Jul 6, 202342

Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming

I talked to Ethan Perez about the Inverse Scaling Prize (deadline August 27!), Training Language Models with Language Feedback and Red-teaming Language Models with Language models. Below are some highlighted quotes from our conversation (available on Youtube, Spotify, Google Podcast, Apple Podcast). For the full context for each of these...

Aug 24, 202226

Why Copilot Accelerates Timelines

> "Say we have intelligences that are narrowly human / superhuman on every task you can think of (which, for what it’s worth, I think will happen within 5-10 years). How long before we have self-replicating factories? Until foom? Until things are dangerously out of our control? Until GDP doubles...

Apr 26, 202235

OpenAI Solves (Some) Formal Math Olympiad Problems

Epistemic status: I have just skimmed through OpenAI's blogpost and paper, I do not fully understand the details. From the blogpost > We built a neural theorem prover for Lean that learned to solve a variety of challenging high-school olympiad problems, including problems from the AMC12 and AIME competitions, as...

Feb 2, 202278

Phil Trammell on Economic Growth Under Transformative AI

This is a transcript with slides for the latest episode (audio, youtube) of "The Inside View", a podcast I host about the future of AI progress. I interview Phil Trammell, an Oxford PhD student in economics and research associate at the Global Priorities Institute. Phil was my roommate and last...

Oct 24, 202114

Michaël Trazzi

Michaël Trazzi

OpenAI Solves (Some) Formal Math Olympiad Problems

A Gym Gridworld Environment for the Treacherous Turn

An Increasingly Manipulative Newsfeed

Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs

Michaël Trazzi

OpenAI Solves (Some) Formal Math Olympiad Problems

A Gym Gridworld Environment for the Treacherous Turn

An Increasingly Manipulative Newsfeed

Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs

Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs

Eric Michaud on the Quantization Model of Neural Scaling, Interpretability and Grokking

Jesse Hoogland on Developmental Interpretability and Singular Learning Theory

Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming

Why Copilot Accelerates Timelines

OpenAI Solves (Some) Formal Math Olympiad Problems

Phil Trammell on Economic Growth Under Transformative AI