x

AI ALIGNMENT FORUM
AF

Myopia — AI Alignment Forum

Myopia

Edited by Dakara, abramdemski, et al. last updated 30th Dec 2024

Myopia refers to short-sightedness in planning and decision-making processes. It describes a tendency to prioritize immediate or short-term outcomes while disregarding longer-term consequences.

The most extreme form of myopia occurs when an agent considers only immediate rewards, completely disregarding future consequences. In artificial intelligence contexts, a perfectly myopic agent would optimize solely for the current query or task without attempting to influence future outcomes.

Myopic agents demonstrate several notable properties:

Limited temporal scope in decision-making
Focus on immediate reward optimization
Reduced instrumental incentives

Posts tagged Myopia

4

34Partial Agency

6y

16

3

40The Credit Assignment Problem

6y

32

4

62How LLMs are and are not myopic

2y

7

2

20Towards a mechanistic understanding of corrigibility

6y

25

3

33Open Problems with Myopia

5y

15

2

24Steering Behaviour: Testing for (Non-)Myopia in Language Models

Evan R. Murphy, Megan Kinniment

3y

5

4

32LCDT, A Myopic Decision Theory

adamShimi, evhub

4y

44

2

17Defining Myopia

6y

13

4

35Arguments against myopic training

5y

38

2

48You can still fetch the coffee today if you're dead tomorrow

3y

14

2

91The Parable of Predict-O-Matic

6y

16

1

72An overview of 11 proposals for building safe advanced AI

6y

32

1

54Seeking Power is Often Convergently Instrumental in MDPs

TurnTrout, Logan Riggs

6y

34

2

44MONA: Managed Myopia with Approval Feedback

Seb Farquhar, David Lindner, Rohin Shah

10mo

7

2

41Thoughts on “Process-Based Supervision” / MONA

2y

1

Load More (15/34)

Add Posts