x

AI ALIGNMENT FORUM

AF

MATS Program — AI Alignment Forum

MATS Program

Edited by Ryan Kidd, et al. last updated 18th Mar 2026

The Machine Alignment, Transparency, and Security (MATS) Program is an independent research and educational seminar program that provides emerging researchers with mentorship, talks & workshops, research support, and connections with the SF Bay Area and London AI safety research communities.

Add Posts

5

5

Posts tagged MATS Program

3

134SolidGoldMagikarp (plus, prompt generation)

Jessica Rumbelow, mwatkins

3y

17

4

32SERI MATS Program - Winter 2022 Cohort

Ryan Kidd, Victor Warlop, Christian Smith

4y

0

5

140Understanding and controlling a maze-solving policy network

TurnTrout, peligrietzer, Ulisse Mini, Monte M, David Udell

3y

23

3

43Soft optimization makes the value target bigger

3y

4

3

25SERI ML Alignment Theory Scholars Program 2022

Ryan Kidd, Victor Warlop, ozhang

4y

0

2

56Finite Factored Sets in Pictures

Magdalena Wache

3y

2

2

57Recontextualization Mitigates Specification Gaming Without Modifying the Specification

ariana_azarbal, Victor Gillioz, TurnTrout, cloud

7mo

0

3

47Predictions for shard theory mechanistic interpretability results

TurnTrout, Ulisse Mini, peligrietzer

3y

6

1

33Modulating sycophancy in an RLHF model via activation steering

Nina Panickssery

3y

19

2

17Infra-Bayesian haggling

2y

1

2

14Normative vs Descriptive Models of Agency

3y

2

1

121Steering GPT-2-XL by adding an activation vector

TurnTrout, Monte M, David Udell, lisathiergart, Ulisse Mini

3y

63

1

145Transformers Represent Belief State Geometry in their Residual Stream

2y

4

1

111Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Jan Betley, Owain_Evans

1y

1

1

65models have some pretty funny attractor states

aryaj, Senthooran Rajamanoharan, Neel Nanda

3mo

0

Load More (15/158)

Add Posts