Go hungry with Almost Free Lunches

Consider the following game, called "Almost Free Lunches" (EDIT: this seems to be a variant of the traveller dilemma). You name any pound-and-pence amount between £0 and £1,000,000; your opponent does likewise. Then you will both get whichever amount named was lowest.

On top of that, the person who named the highest amount must give £0.02 to the other. If you tie, no extra money changes hands.

What's the Nash equilibrium of this game? Well:

The only Nash equilibrium of Almost Free Lunches is for both of you to name £0.00.

Proof: Suppose player A has a probability distribution $p_{A}$ over possible amounts to name, and player $B$ has a probability distribution $p_{B}$ over possible amounts. Let $m_{A}$ be the highest amount such that $p_{A} (m_{A})$ is non-zero; let $m_{B}$ be the same, for $B$ . Assume that $(p_{A}, p_{B})$ is a Nash equilibrium.

Assume further that $m_{A} \geq m_{B}$ (if that's not the case, then just switch the labels $A$ and $B$ ). Then either $m_{A} >$ £0.00 or $m_{A} =$ £0.00 (and hence both players select £0.00).

We'll now rule out $m_{A} >$ £0.00. If $m_{B} >$ £0.00, then player $A$ can improve their score by replacing $m_{A}$ with $m_{A}^{'} = m_{B} - £ 0.01$ . To see this, assume that player $B$ has said $n_{B}$ , and player $A$ has said $m_{A}$ . If $n_{B} < m_{A}^{'} < m_{A}$ , then player $A$ can say $m_{A}^{'}$ just as well as $m_{A}$ - either choice gives them the same amount (namely, $n_{B} -$ £0.02).

There remain two other cases. If $n_{B} = m_{A}^{'}$ , then $m_{A}^{'}$ is superior to $m_{A}$ , getting $m_{A}^{'}$ (rather than $m_{A}^{'} -$ £0.02). And if $n_{B} = m_{B}$ , then $m_{A}^{'}$ gets $m_{A}^{'} +$ £0.02 $= m_{B} +$ £0.01, rather than $m_{B}$ (if $m_{A} = m_{B}$ ) or $m_{B} - £ 0.02$ (if $m_{A} > m_{B}$ ).

Finally, if $m_{B} =$ £0.00, then player $A$ gets -£0.02 unless they also say £0.00.

Hence if $m_{A} >$ £0.00, the $p_{A}$ cannot be part of a Nash Equilibrium. Thus $m_{A} =$ £0.00 and hence the only Nash Equilibrium is at both players saying £0.00.

Pareto optimal

There are three Pareto-optimal outcomes: (£1,000,000.00, £1,000,000.00), (£1,000,000.01, £999,999.97), and (£999,999.97, £1,000,000.01). All of them are very much above the Nash Equilibrium.

Minmax and maximin

The minmax and maximin values are also both terrible, and also equal to £0.00. This is not surprising, though, as minmax and maximin implicitly assume the other players are antagonistic to you, and are trying to keep your profits low.

Arbitrary badness with two options

This shows that choosing the Nash Equilibrium can be worse than almost every other option. We can of course increase the maximal amount, and get the Nash Equilibrium to be arbitrarily worse than any reasonable solution (I would just say either £1,000,000.00 or £999,999.99, and leave it at that).

But we can also make the Nash Equilibrium arbitrarily close to the worst possible outcome, and that without even requiring more than two options for each player.

Assume that there are four ordered amounts of money/utility: $n_{3} > n_{2} > n_{1} > n_{0}$ . Each player can name $n_{2}$ or $n_{1}$ . Then if they both name the same, they get that amount of utility. If they name different ones, then then player naming $n_{2}$ gets $n_{0}$ , and the player naming $n_{1}$ gets $n_{3}$ .

By the same argument as above, the only Nash equilibrium is for both to name $n_{1}$ . The maximum possible amount is $n_{3}$ ; the maximum they can get if they both coordinate is $n_{2}$ , the Nash equilibrium is $n_{1}$ , and the worst option is $n_{0}$ . We can set $n_{1} = n_{0} + ϵ$ and $n_{3} = n_{2} + ϵ$ for arbitrarily tiny $ϵ > 0$ , while setting $n_{2}$ to be larger than $n_{1}$ by some arbitrarily high amount.

So the situation is as bad as it could possibly be.

Note that this is a variant of the prisoner's dilemma with different numbers. You could describe it as "Your companion goes to a hideous jail if and only if you defect (and vice versa). Those that don't defect will also get a dust speck in their eye."

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

12

Nash equilibriums can be arbitrarily bad

12

Go hungry with Almost Free Lunches

Pareto optimal

Minmax and maximin

Arbitrary badness with two options