Geoffrey Irving

Announcing our $160M grant from Coefficient Giving

We are excited to announce that Resolution (fka Sequent) has a $160M grant from Coefficient Giving (cG) to put rigorous alignment research on a (closer to) even footing with the frontier labs. We will use it to accelerate progress towards higher-confidence alignment, or to find evidence and obstacles showing why...

Jul 949

Geoffrey Irving's Shortform

Jun 306

Resolution (fka Sequent): scale and automation for higher confidence in alignment

EDIT: We originally launched under the name Sequent. Read why we renamed to Resolution. Alignment is not on track Artificial superintelligence (ASI) may be developed in the next few years. It is unclear whether alignment is on track to be ready on the same timeframe. At a minimum, the empirical...

Jun 10297

Research Areas in Cognitive Science (The Alignment Project by UK AISI)

The Alignment Project is a global fund of over £15 million, dedicated to accelerating progress in AI control and alignment research. It is backed by an international coalition of governments, industry, venture capital and philanthropic funders. This post is part of a sequence on research areas that we are excited...

Aug 1, 202512

The Alignment Project by UK AISI

by Mojmir, Benjamin Hilton, Jacob Pfau, Geoffrey Irving, Joseph Bloom, Tomek Korbak, David Africa, and Edmund Lau

The Alignment Project is a global fund of over £15 million, dedicated to accelerating progress in AI control and alignment research. It is backed by an international coalition of governments, industry, venture capital and philanthropic funders. This sequence sets out the research areas we are excited to fund – we...

Aug 1, 202529

The need to relativise in debate

Summary: This post highlights the need for results in AI safety, such as debate or scalable oversight, to 'relativise', i.e. for the result to hold even when all parties are given access to a black box 'oracle' (the oracle might be a powerful problem solver, a random function, or a...

Jun 26, 202531

Prover-Estimator Debate: A New Scalable Oversight Protocol

by Jonah Brown-Cohen and Geoffrey Irving

Linkpost to arXiv: https://arxiv.org/abs/2506.13609. Summary: We present a scalable oversight protocol where honesty is incentivized at equilibrium. Prior debate protocols allowed a dishonest AI to force an honest AI opponent to solve a computationally intractable problem in order to win. In contrast, prover-estimator debate incentivizes honest equilibrium behavior, even when...

Jun 17, 202589

Geoffrey Irving

Geoffrey Irving

Resolution (fka Sequent): scale and automation for higher confidence in alignment

DeepMind is hiring for the Scalable Alignment and Alignment Teams

UK AISI’s Alignment Team: Research Agenda

Prover-Estimator Debate: A New Scalable Oversight Protocol

Geoffrey Irving

Resolution (fka Sequent): scale and automation for higher confidence in alignment

DeepMind is hiring for the Scalable Alignment and Alignment Teams

UK AISI’s Alignment Team: Research Agenda

Prover-Estimator Debate: A New Scalable Oversight Protocol

Announcing our $160M grant from Coefficient Giving

Geoffrey Irving's Shortform

Resolution (fka Sequent): scale and automation for higher confidence in alignment

Research Areas in Cognitive Science (The Alignment Project by UK AISI)

The Alignment Project by UK AISI

The need to relativise in debate

Prover-Estimator Debate: A New Scalable Oversight Protocol