x

AI ALIGNMENT FORUM

AF

Sharp Left Turn — AI Alignment Forum

Sharp Left Turn

Edited by Multicore, et al. last updated 30th Dec 2024

Sharp Left Turn is a scenario where, as an AI trains, its capabilities generalize across many domains while the alignment properties that held at earlier stages fail to generalize to the new domains.

See also: Threat Models, AI Takeoff, AI Risk

Add Posts

2

2

Posts tagged Sharp Left Turn

9

74A central AI alignment problem: capabilities generalization, and the sharp left turn

4y

18

4

37Refining the Sharp Left Turn threat model, part 1: claims and mechanisms

Vika, Vikrant Varma, Ramana Kumar, Mary Phuong

4y

3

3

16We may be able to see sharp left turns coming

Ethan Perez, Neel Nanda

4y

25

4

88“Sharp Left Turn” discourse: An opinionated review

1y

13

3

19Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika, Vikrant Varma, Ramana Kumar, Rohin Shah

3y

5

3

67Evolution provides no evidence for the sharp left turn

3y

12

3

22It matters when the first sharp left turn happens

4y

1

1

23We don't understand what happened with culture enough

3y

3

1

30Reframing inner alignment

3y

9

1

46Response to Quintin Pope's Evolution Provides No Evidence For the Sharp Left Turn

3y

0

1

23Smoke without fire is scary

4y

9

1

10Disentangling inner alignment failures

4y

4

Add Posts