AI ALIGNMENT FORUM
AF

61
Wikitags
Main
3
LW Wiki

AIXI

Edited by Eliezer Yudkowsky, Brian Muhia, et al. last updated 6th Oct 2017
Requires: Solomonoff induction, Expected utility

Marcus Hutter's AIXI is the perfect rolling sphere of advanced agent theory - it's not realistic, but you can't understand more complicated scenarios if you can't envision the rolling sphere. At the core of AIXI is Solomonoff induction, a way of using infinite computing power to probabilistically predict binary sequences with (vastly) superintelligent acuity. Solomonoff induction proceeds roughly by considering all possible computable explanations, with prior probabilities weighted by their algorithmic simplicity, and updating their probabilities based on how well they match observation. We then translate the agent problem into a sequence of percepts, actions, and rewards, so we can use sequence prediction. AIXI is roughly the agent that considers all computable hypotheses to explain the so-far-observed relation of sensory data and actions to rewards, and then searches for the best strategy to maximize future rewards. To a first approximation, AIXI could figure out every ordinary problem that any human being or intergalactic civilization could solve. If AIXI actually existed, it wouldn't be a god; it'd be something that could tear apart a god like tinfoil.

Further information:

  • Marcus Hutter's book on AIXI
  • Marcus Hutter's gentler introduction
  • Wikpedia article on AIXI
  • LessWrong Wiki article on AIXI
  • AIXIjs: Interactive browser demo and General Reinforcement Learning tutorial (JavaScript)
Parents:
Central examples
Methodology of unbounded analysis
Children:
AIXI-tl
Subscribe
Discussion
3
Subscribe
Discussion
3
Posts tagged AIXI
11Potential Alignment mental tool: Keeping track of the types
Donald Hobson
4y
0
13Rebuttals for ~all criticisms of AIXI
Cole Wyeth
9mo
3
11Reflective AIXI and Anthropics
Diffractor
7y
14
7Failures of UDT-AIXI, Part 1: Improper Randomizing
Diffractor
7y
0
6The "best predictor is malicious optimiser" problem
Donald Hobson
5y
7
0Corrigibility for AIXI via double indifference
Stuart_Armstrong
9y
0
24Open Problems in AIXI Agent Foundations
Cole Wyeth
1y
0
14Free Will and Dodging Anvils: AIXI Off-Policy
Cole Wyeth
1y
0
7Summary of the Acausal Attack Issue for AIXI
Diffractor
4y
2
Add Posts