AI ALIGNMENT FORUMTags
AF

Sharp Left Turn

EditHistorySubscribe
Discussion (0)
Help improve this page
EditHistorySubscribe
Discussion (0)
Help improve this page
Sharp Left Turn
Random Tag
Contributors
1Multicore

A Sharp Left Turn is a scenario where, as an AI trains, its capabilities generalize across many domains while the alignment properties that held at earlier stages fail to generalize to the new domains.

See also: Threat Models, AI Takeoff, AI Risk

Posts tagged Sharp Left Turn
5
85A central AI alignment problem: capabilities generalization, and the sharp left turn
Nate Soares
1y
18
2
36Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Victoria Krakovna, Vikrant Varma, Ramana Kumar, Mary Phuong
1y
1
1
52Evolution provides no evidence for the sharp left turn
Quintin Pope
5mo
12
1
16We may be able to see sharp left turns coming
Ethan Perez, Neel Nanda
1y
25
1
28Reframing inner alignment
davidad (David A. Dalrymple)
9mo
9
1
20Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Victoria Krakovna, Vikrant Varma, Ramana Kumar, Rohin Shah
10mo
5
1
23Smoke without fire is scary
Adam Jermyn
1y
7
1
21It matters when the first sharp left turn happens
Adam Jermyn
1y
1
1
9Disentangling inner alignment failures
Erik Jenner
1y
4
Add Posts