Decisions: Ontologically Shifting to Determinism

facts about the world that we cannot ignore

Updateless decisions are made by agents that know less, to an arbitrary degree. In UDT proper, there is no choice in how much an agent doesn't know, you just pick the best policy from a position of maximal ignorance. It's this policy that needs to respond to possible and counterfactual past/future observations, but the policy itself is no longer making decisions, the only decision was about picking the policy.

But in practice knowing too little leads to inability to actually compute (or even meaningfully "write down") an optimal decision/policy, it becomes necessary to forget less, and that leads to a decision about how much to forget. When forgetting something, you turn into a different, more ignorant agent. So the choice to forget something in particular is a choice to turn into a particular other agent. More generally, you would interact with this other agent instead of turning into it. Knowing too little is also a problem when there is no clear abstraction of preference that survives the amnesia.

This way, updateless decision making turns into acausal trade where you need to pick who to trade with. There is a change in perspective here, where instead of making a decision personally, you choose whose decision to follow. The object level decision itself is made by someone else, but you pick who to follow based on considerations other than the decision they make. This someone else could also be a moral principle, or common knowledge you have between yourself and another agent, this moral principle or common knowledge just needs to itself take the form of an agent. See also these comments.

[-]Chris_Leong3y20

UDT doesn't really counter my claim that Newcomb-like problems are problems in which we can't ignore that our decisions aren't independent of the state of the world when we make that decision, even though in UDT we know less. To make this clear in the example of Newcomb's, the policy we pick affects the prediction which then affects the results of the policy when the decision is made. UDT isn't ignoring the fact that our decision and the state of the world are tied together, even if it possibly represents it in a different fashion. The UDT algorithm takes this into account regardless of whether the UDT agent models this explicitly.

I'll get to talking about UDT rather than TDT soon. I intend for my next post to be about Counterfactual Mugging and why this is such a confusing problem.

[-]Vladimir_Nesov3y10

UDT still doesn't forget enough. Variations on UDT that move towards acausal trade with arbitrary agents are more obviously needed because UDT forgets too much, since that makes it impossible to compute in practice and forgetting less poses a new issue of choosing a particular updateless-to-some-degree agent to coordinate with (or follow). But not forgetting enough can also be a problem.

In general, an external/updateless agent (whose suggested policy the original agent follows) can forget the original preference, pursue a different version of it that has undergone an ontological shift. So it can forget the world and its laws, as long as the original agent would still find it to be a good idea to follow its policy (in advance, based on the updateless agent's nature, without looking at the policy). This updateless agent is shared among the counterfactual variants of the original agent that exist in the updateless agent's ontology, it's their chosen updateless core, the source of coherence in their actions.

[-]Chris_Leong3y10

How much do you think we should forget?

^{^}

We've made a slight simplification here by ignoring quantum mechanics. Quantum mechanics only shifts us from the state of the world being deterministic, to the probability distribution being deterministic. It doesn't provide scope for free will, so it doesn't avoid the ontological shift.

^{^}

I understand that the free-will/determinism debate is contentious, but I don't want to revisit it in this post.

^{^}

One alternate solution would be to toss out the notion of Newcomb-like problems. However, this seems difficult, as even if we are skeptical of perfect predictors for humans, we can set up this situation in code where Omega has access to the contestant program's source code. So I don't see this as a solution.

^{^}

If we adopt the perspective in which the counterfactual when we one-box includes $1 million in the box, while the counterfactual when we two-box lacks the million, it becomes unclear whether it is fair to compare these two counterfactuals, as it appears as though the agent is facing different problem setups in the two counterfactuals.

^{^}

Or any one of the best options in the event of a tie.

^{^}

The box that is either empty or contains the million.

^{^}

Technically, they generally employ a variant known as either Updateless Decision Theory or Functional Decision Theory, depending on the preferred terminology.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

5

Decisions: Ontologically Shifting to Determinism

5

Our Naive Ontology Assumes Libertarian Free Will

Adapting to Determinism by Augmenting with Counterfactuals

In Favour of Consistent Counterfactuals

Ranking Consistent Counterfactuals in the Obvious Way

Modelling Decisions As Choices from Outside the Universe(s)

Application to Newcomb's Problem

But we're still Using the Naive Ontology!

So You're Saying we Should use Timeless Decision Theory? We Already Knew That!

What about Updateless Decision Theory?