AI ALIGNMENT FORUM
AF

Wikitags

Embedded Agency

Edited by Ben Pace, Noosphere89, et al. last updated 4th Jan 2023

Embedded Agency is the problem that an understanding of the theory of rational agents must account for the fact that the agents we create (and we ourselves) are inside the world or universe we are trying to affect, and not separated from it. This is in contrast with much current basic theory of AI or Rationality (such as Solomonoff induction or Bayesianism) which implicitly supposes a separation between the agent and the-things-the-agent-has-beliefs about. In other words, agents in this universe do not have Cartesian or dualistic boundaries like much of philosophy assumes, and are instead reductionist, that is agents are made up of non-agent parts like bits and atoms.

Embedded Agency is not a fully formalized research agenda, but Scott Garrabrant and Abram Demski have written the canonical explanation of the idea in their sequence Embedded Agency. This points to many of the core confusions we have about rational agency and attempts to tie them into a single picture.

Subscribe
2
Subscribe
2
Discussion0
Discussion0
Posts tagged Embedded Agency
54Embedded Agency (full-text version)
Scott Garrabrant, abramdemski
7y
4
45Embedded Agents
abramdemski, Scott Garrabrant
7y
7
22Humans Are Embedded Agents Too
johnswentworth
6y
14
57Introduction to Cartesian Frames
Scott Garrabrant
5y
18
28Draft papers for REALab and Decoupled Approval on tampering
Jonathan Uesato, Ramana Kumar
5y
2
34Decision Theory
abramdemski, Scott Garrabrant
7y
14
33Robust Delegation
abramdemski, Scott Garrabrant
7y
2
28Subsystem Alignment
abramdemski, Scott Garrabrant
7y
3
25Embedded World-Models
abramdemski, Scott Garrabrant
7y
5
18Embedded Agency via Abstraction
johnswentworth
6y
16
26Embedded Curiosities
Scott Garrabrant, abramdemski
7y
0
36Updates and additions to "Embedded Agency"
Rob Bensinger, abramdemski
5y
1
12You Only Get One Shot: an Intuition Pump for Embedded Agency
Oliver Sourbut
3y
0
31Reducing LLM deception at scale with self-other overlap fine-tuning
Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Judd Rosenblatt, Cameron Berg, Mike Vaiana, Trent Hodgeson
6mo
9
38Infra-Bayesian physicalism: a formal theory of naturalized induction
Vanessa Kosoy
4y
11
Load More (15/60)
Add Posts