x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
AI Auditing — AI Alignment Forum
AI Auditing
Edited by
Raemon
last updated
4th Aug 2025
Formerly "auditing games"
Subscribe
Discussion
Subscribe
Discussion
Posts tagged
AI Auditing
Most Relevant
4
43
Automating Auditing: An ambitious concrete technical research proposal
evhub
4y
11
3
76
A transparency and interpretability tech tree
evhub
3y
10
1
82
Auditing language models for hidden objectives
Sam Marks
,
Johannes Treutlein
,
dmz
,
Sam Bowman
,
Hoagy
,
Carson Denison
,
Kei Nishimura-Gasparian
,
7vik
,
Akbir Khan
,
Austin Meek
,
Euan Ong
,
Christopher Olah
,
Fabien Roger
,
jeanne_
,
Meg
,
Drake Thomas
,
Adam Jermyn
,
Monte M
,
evhub
9mo
3
1
62
Towards Alignment Auditing as a Numbers-Go-Up Science
Sam Marks
4mo
8
1
31
Putting up Bumpers
Sam Bowman
7mo
7
2
22
What progress have we made on automated auditing?
Q
LawrenceC
1y
Q
0
1
18
Auditing games for high-level interpretability
Paul Colognese
3y
0