AI ALIGNMENT FORUM
AF

Breaking Down Goal-Directed Behaviour

Jun 09, 2022 by Oliver Sourbut

When we speak about entities 'wanting' things, or having 'goal-directed behaviour', what do we mean?

Here I aim to take steps to break down 'goal-directed behaviour' into a conceptual framework of computational abstractions for which I offer tentative terminology, and which helps me to better understand and describe analogies and disanalogies between various goal-directed systems. The overarching motivation is to better understand goal-directed behaviour, in the sense of being able to better predict its (especially counterfactual and off-distribution) implications, its arisal, and other properties. Hopefully it is clear why I consider this worthwhile.

7Breaking Down Goal-Directed Behaviour
Oliver Sourbut
3y
0
12You Only Get One Shot: an Intuition Pump for Embedded Agency
Oliver Sourbut
3y
0
7Deliberation, Reactions, and Control: Tentative Definitions and a Restatement of Instrumental Convergence
Oliver Sourbut
3y
0
13Deliberation Everywhere: Simple Examples
Oliver Sourbut
3y
0