Buridan's ass in coordination games

[-]gjm7y20

This doesn't (I think) really have much to do with randomness as such. The relevant thing about R is that it's shared information that a hypothetical adversary doesn't get to see.

If $u_{y}$ isn't chosen adversarially, then our players don't care about pessimizing over $u_{y}$ but about something like an average over $u_{y}$ , and then R isn't needed. Or, if they are ultra-cautious people who universally care about worst cases, then they don't care about expectation w.r.t. R but about the worst case as R varies, and then R doesn't help. R only helps them when $u_{y}$ is chosen adversarially and R isn't; or when they care about pessimizing w.r.t $u_{y}$ but not w.r.t. R.

So the real conclusion here is not "sometimes shared randomness helps", it's "if some people are trying to coordinate in the presence of an adversary, anything they share and the adversary has no access to helps".

[-]paulfchristiano7y30

Sometimes you want to prove a theorem like "The algorithm works well." You generally need randomness if you want to find algorithms that work without strong assumptions on the environment, whether or not there is really an adversary (who knows what kinds of correlations exist in the environment, whether or not you call them an "adversary").

A bayesian might not like this, because they'd prefer prove theorems like "The algorithm works well on average for a random environment drawn from the prior the agents use," for which randomness is never useful.

But specifying the true prior is generally hideously intractable. So a slightly more wise Bayesian might want to prove statements like "The algorithm well on average for a random environment drawn from the real prior" where the "real prior" is some object that we can talk about but have no explicit access to. And now the wiser Bayesian is back to needing randomness.

[-]Wei Dai7y20

A bayesian might not like this, because they’d prefer prove theorems like “The algorithm works well on average for a random environment drawn from the prior the agents use,” for which randomness is never useful.

It seems like a bayesian can conclude that randomness is useful, if their prior puts significant weight on "the environment happens to contain something that iterates over my decision algorithm and returns its worst-case input, or something that's equivalent to or approximates this" (which they should, especially after updating on their own existence). I guess right now we don't know how to handle this in a naturalistic way (e.g., let both intentional and accidental adversaries fall out of some simplicity prior) and so are forced to explicitly assume the existence of adversaries (as in game theory and this post).

[-]jessicata7y20

This seems basically right. As discussed in the conclusion, there are reasons to care about worst-case performance other than literal adversaries.

[-]Dagon7y10

Sure, but non-adversarial cases (really, any cases where u is determined independently of strategies chosen) can just choose R as a fixed part of the strategy, rather than a random shared component determined later.

[-]jessicata7y30

That's right, but getting the worst-case guarantee requires this initial choice to be random.

[-]Dagon7y00

Nope. Random choice gives a specific value for R each game. The outcome for that iteration is IDENTICAL to the outcome if that R was chosen intentionally. Randomness only has game value as a mechanism to keep information from an adversarial actor.

[-]jessicata7y10

To be clear, by "worst-case guarantee" I mean "the expected utility is guaranteed to be pretty good regardless of $u_{y}$ ", which is unattainable without shared randomness (claim 1).

I think you are either misunderstanding or disagreeing with a lot of the terminology on randomized algorithms and worst-case guarantees that are commonly used in CS and statistics. This article is a decent introduction to this topic.

[-]Dagon7y10

I'm missing something (and I haven't digested the math, so maybe it's obvious but just missing from the narrative description). Is epsilon the same for both players, in that they see the same V, it just may not exactly match u? or is it different for each player, meaning for the same u, they have different V? From your analysis (risk of 0), it sounds like the latter.

In that case, I don't see how additional shared knowledge helps coordinate them, nor why it needs to be random rather than just a fixed value they agree on in advance. And certainly not why it matters if the additional random shared value is generated before or after the game starts.

If they don't have this additional source of shared randomness, can they just decide in their pre-game discussion to use R=0.5? Why or why not?

[-]jessicata7y20

$ϵ$ is the same for both players but $V_{1}$ and $V_{2}$ (the players' observations of $u_{y}$ ) are different, both sampled independently uniformly from $[u_{y} - ϵ, u_{y} + ϵ]$ .

If they decide on $R = 0.5$ then there exists some $u_{y}$ value for which they get a bad expected utility (see Claim 1).

[-]Dagon7y00

Sure, but that goes for a randomly-chosen R too. For every possible R, there is a u value for which they get bad outcomes. It doesn't get better by randomly choosing R.

[-]jessicata7y10

The assumption is that $R$ is chosen after $u_{y}$ . So for every $u_{y}$ the pair of policies gets a good expected utility. See the point on Bayesian algorithms in the conclusion for more on why "get a high expected utility regardless of $u_{y}$ " might be a desirable goal.

[-]Dagon7y00

R∼Uniform([0,1])

How can it possibly matter whether R is chosen before or after uy? R is completely independent of u, right? It's not a covert communication mechanism about the players' observations, it's a random value.

[-]jessicata7y20

If $u_{y}$ is chosen after $R$ then it might be chosen to depend on $R$ in such a way that the algorithm gets bad performance, e.g. using the method in the proof of Claim 1.

[-]Dagon7y10

Based on other comments, I realize I'm making an assumption for something you haven't specified. How is uy chosen? If it's random and independent, then my assertion holds, if it's selected by an adversary who knows the players' full strategies somehow, then R is just a way of keeping a secret from the adversary - sequence doesn't matter, but knowledge does.

[-]jessicata7y10

Claim 1 says there exists some $u_{y}$ value for which the algorithm gets high regret, so we might as well assume it's chosen to maximize regret.

Claim 2 says the algorithm has low regret regrardless of $u_{y}$ , so we might as well assume it's chosen to maximize regret.

[-]Dagon7y-10

uy and R are independently chosen from well-defined distributions. Regardless of sequence, neither knows the other and CANNOT be chosen based on the other. I'll see if I can find time tonight to figure out whether I'm saying your claim 1 is wrong (it dropped epsilon too soon from the floor value, but I'm not sure if it's more fundamentally problematic than that) or that your claim 2 is misleading.

My current expectation is that I'll find that your claim 2 results are available in situation 1, by using your given function with a pre-agreed value rather than a random one.

[-]Rohin Shah7y30

The theorems are of the form "For all uy, you get good outcomes" or "There exists a uy that causes bad outcomes".

When you want to prove statements of this form, uy is chosen adversarially, so it matters whether it is chosen before or after R.

uy and R are independently chosen from well-defined distributions.

What distribution is uy chosen from? That's not specified anywhere in the post.

[-]Laszlo_Treszkai7y00

True, they will fail to cooperate for some R, but the values of such R have a low probability. (But yeah, it's also required that uy and R are chosen independently—otherwise an adversary could just choose either so that it results in the players choosing different actions.)

The smoothness comes in from marginalising a random R. The coordination comes from making R and ε common knowledge, so they cooperate using the correlation in their observations—an interesting phenomenon.

(How can I write LaTeX in the comments?)

[-]jessicata7y10

(How can I write LaTeX in the comments?)

ctrl-4

[-]Donald Hobson7y10

If you assume a fixed probability distribution over possible $u_y$ that both players know when coordinating, then they can set up the rules they choose to make sure that they probably win. The extra random information is only useful because of the implicit "for all $u_y$". If some malicious person had overheard their strategy, and was allowed to choose $u_y$, but didn't have access to the random number source, then the random numbers are useful.

[-]avturchin7y10

I expected that Lamport paper would be mentioned, as it describes a known catastrophic mode for autonomous systems, connected with Buridan ass problem and infinite recursion about predicting future time of the problem solving. I think that this problem is underexplored for AI Safety, despite previous attempt to present it on LessWrong.

[-]cousin_it7y10

Is this a reinvention of correlated equilibrium?

[-]jessicata7y20

This is not a reinvention of correlated equilibrium, although that is related.

[-]cousin_it7y20

Looks like I was wrong again and shame on me. Correlated equilibrium can't be useful in a purely cooperative game, it's only useful if we're trying to minimax over many cooperative games. You even spelled it out in the post, but my eyes skipped over it. Sorry again.

[-]cousin_it7y20

Yeah, sorry, I was hasty. Looks like you're making a more interesting point: correlated equilibrium can be useful even in purely cooperative games. I wonder what would be the simplest example game...

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

20

Buridan's ass in coordination games

20

Claim 1: impossibility of solving the game using independent randomness

Claim 2: solving the game using shared randomness

Conclusion and directions for further research

Appendix: extension to arbitrary normal-form cooperative games

Players play according to $a^{g (u, R)}$ with high probability

$a^{g (u, R)}$ is near-optimal in expectation

Proving the result from these facts

20

Buridan's ass in coordination games

20

Claim 1: impossibility of solving the game using independent randomness

Claim 2: solving the game using shared randomness

Conclusion and directions for further research

Appendix: extension to arbitrary normal-form cooperative games

Players play according to ag(u,R) with high probability

ag(u,R) is near-optimal in expectation

Proving the result from these facts

Players play according to $a^{g (u, R)}$ with high probability

$a^{g (u, R)}$ is near-optimal in expectation