This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
DeepMind Alignment Team on Threat Models
AF
Login
DeepMind Alignment Team on Threat Models
A collection of posts presenting our understanding of and opinions on alignment threat models.
107
DeepMind alignment team opinions on AGI ruin arguments
Victoria Krakovna
5mo
5
45
Will Capabilities Generalise More?
Ramana Kumar
7mo
28
37
Clarifying AI X-risk
Zachary Kenton
,
Rohin Shah
,
David Lindner
,
Vikrant Varma
,
Victoria Krakovna
,
Mary Phuong
,
Ramana Kumar
,
Elliot Catt
3mo
14
32
Threat Model Literature Review
Zachary Kenton
,
Rohin Shah
,
David Lindner
,
Vikrant Varma
,
Victoria Krakovna
,
Mary Phuong
,
Ramana Kumar
,
Elliot Catt
3mo
3
32
Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Victoria Krakovna
,
Vikrant Varma
,
Ramana Kumar
,
Mary Phuong
6mo
1
20
Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Victoria Krakovna
,
Vikrant Varma
,
Ramana Kumar
,
Rohin Shah
2mo
4