AI ALIGNMENT FORUM
AF

Ambitious value learning aims to give the AI the correct utility function to avoid catastrophe. Given its difficulty, we revisit the arguments for utility functions in the first place.

20Intuitions about goal-directed behavior

Rohin Shah

7y

5

54Coherence arguments do not entail goal-directed behavior

Rohin Shah

7y

50

26Will humans build goal-directed agents?

Rohin Shah

7y

22

25AI safety without goal-directed behavior

Rohin Shah

7y

6

Narrow Value Learning

8What is narrow value learning?

Rohin Shah

7y

3

13Ambitious vs. narrow value learning

paulfchristiano

7y

15

16Human-AI Interaction

Rohin Shah

7y

0

9Reward uncertainty

Rohin Shah

7y

3

9The human side of interaction

Rohin Shah

7y

2

21Following human norms

Rohin Shah

7y

4

6Future directions for narrow value learning

Rohin Shah

7y

2

21Conclusion to the sequence on value learning

Rohin Shah

6y

1