AI ALIGNMENT FORUM
Wikitags
AF

Subscribe
Discussion0
2

Sharp Left Turn

Subscribe
Discussion0
2
Written by Multicore, et al. last updated 30th Dec 2024

Sharp Left Turn is a scenario where, as an AI trains, its capabilities generalize across many domains while the alignment properties that held at earlier stages fail to generalize to the new domains.

See also: Threat Models, AI Takeoff, AI Risk

Posts tagged Sharp Left Turn
74A central AI alignment problem: capabilities generalization, and the sharp left turn
Nate Soares
3y
18
37Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Victoria Krakovna, Vikrant Varma, Ramana Kumar, Mary Phuong
3y
3
16We may be able to see sharp left turns coming
Ethan Perez, Neel Nanda
3y
25
84“Sharp Left Turn” discourse: An opinionated review
Steve Byrnes
4mo
13
23We don't understand what happened with culture enough
Jan_Kulveit
2y
3
30Reframing inner alignment
davidad (David A. Dalrymple)
3y
9
19Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Victoria Krakovna, Vikrant Varma, Ramana Kumar, Rohin Shah
3y
5
67Evolution provides no evidence for the sharp left turn
Quintin Pope
2y
12
46Response to Quintin Pope's Evolution Provides No Evidence For the Sharp Left Turn
Zvi
2y
0
23Smoke without fire is scary
Adam Jermyn
3y
9
22It matters when the first sharp left turn happens
Adam Jermyn
3y
1
9Disentangling inner alignment failures
Erik Jenner
3y
4
Add Posts