AI ALIGNMENT FORUM
AF

Wikitags

Decision theory

Edited by Ruby, et al. last updated 21st Mar 2025

Decision Theory is the study of principles and algorithms for making correct decisions—that is, decisions that allow an agent to achieve better outcomes with respect to its goals. Every action at least implicitly represents a decision under uncertainty: in a state of partial knowledge, something has to be done, even if that something turns out to be nothing (call it "the null action"). Even if you don't know how you make decisions, decisions do get made, and so there has to be some underlying mechanism. What is it? And how can it be done better? Decision theory has the answers.

Note: this page needs to be updated with content regarding Functional Decision Theory, the latest theory from MIRI.

Related: Game Theory, Robust Agents, Utility Functions

A core idea in decision theory is that of expected utility maximization, usually intractable to directly calculate in practice, but an invaluable theoretical concept. An agent assigns utility to every possible outcome: a real number representing the goodness or desirability of that outcome. The mapping of outcomes to utilities is called the agent's utility function. (The utility function is said to be invariant under affine transformations: that is, the utilities can be scaled or translated by a constant while resulting in all the same decisions.) For every action that the agent could take, sum over the utilities of the various possible outcomes weighted by their probability: this is the expected utility of the action, and the action with the highest expected utility is to be chosen.

Thought experiments

The limitations and pathologies of decision theories can be analyzed by considering the decisions they suggest in the certain idealized situations that stretch the limits of decision theory's applicability. Some of the thought experiments more frequently discussed on LW include:

  • Newcomb's problem
  • Counterfactual mugging
  • Parfit's hitchhiker
  • Smoker's lesion
  • Absentminded driver
  • Sleeping Beauty problem
  • Prisoner's dilemma
  • Pascal's mugging

Commonly discussed decision theories

Standard theories well-known in academia:

  • CDT, Causal Decision Theory
  • EDT, Evidential Decision Theory

Theories invented by researchers associated with MIRI and LW:

  • FDT: Functional Decision Theory
  • TDT, Timeless Decision Theory
  • UDT, Updateless Decision Theory
  • ADT: Ambient Decision Theory (a variant of UDT)
  • FDT: Cheating Death in Damascus

Other decision theories are listed in A comprehensive list of decision theories.

Blog posts

  • Terminal Values and Instrumental Values
  • Decision Theories: A Less Wrong Primer by orthonormal
  • Decision Theory FAQ by lukeprog and crazy88

Introductions to logical decision theories by Eliezer Yudkowsky

  • For Computer Scientists
  • For Economists
  • For Analytic Philosophers
  • For Everyone Else

Sequence by AnnaSalamon

  • Decision theory: An outline of some upcoming posts
  • Confusion about Newcomb is confusion about counterfactuals
  • Why we need to reduce “could”, “would”, “should”
  • Why Pearl helps reduce “could” and “would”, but still leaves us with at least three alternatives

Sequence by orthonormal (Decision Theories: A Semi-Formal Analysis)

  • Part 0: Decision Theories: A Less Wrong Primer
  • Part I: The Problem with Naive Decision Theory
  • Part II: Causal Decision Theory and Substitution
  • Part III: Formalizing Timeless Decision Theory

See also

  • Instrumental rationality
  • Causality
  • Expected utility
  • Evidential Decision Theory
  • Timeless decision theory, Updateless decision theory
  • AIXI
Children:
Expected utility formalism
Causal decision theories
and 3 more
Subscribe
Subscribe
Discussion0
Discussion0
Posts tagged Decision theory
50Can you control the past?
Joe Carlsmith
4y
8
94UDT shows that decision theory is more puzzling than ever
Wei Dai
2y
15
34Decision Theory
abramdemski, Scott Garrabrant
7y
14
61An Orthodox Case Against Utility Functions
abramdemski
5y
45
22Dutch-Booking CDT: Revised Argument
abramdemski
5y
20
52Coherence arguments do not entail goal-directed behavior
Rohin Shah
7y
50
54Embedded Agency (full-text version)
Scott Garrabrant, abramdemski
7y
4
32LCDT, A Myopic Decision Theory
adamShimi, evhub
4y
44
30A Critique of Functional Decision Theory
wdmacaskill
6y
26
22MIRI/OP exchange about decision theory
Rob Bensinger
4y
5
54Responses to apparent rationalist confusions about game / decision theory
Anthony DiGiovanni
2y
0
39Counterfactual Mugging Poker Game
Scott Garrabrant
7y
0
35Troll Bridge
abramdemski
6y
50
23What does it mean to apply decision theory?
abramdemski
5y
1
15Comment on Coherence arguments do not imply goal directed behavior
Ronny Fernandez
6y
3
Load More (15/117)
Add Posts