Cooperative Oracles: Stratified Pareto Optima and Almost Stratified Pareto Optima

[-]AlexMennen9y40

This notion of dependency seems too binary to me. Concretely, let's modify your example from the beginning so that $P_{3}$ must grant an extra $10^{- 10}$ utility to either $P_{1}$ or $P_{2}$ , and gets to decide which. Now, everyone's utility depends on everyone's actions, and the game is still zero-sum, so again, so any strategy profile with $p = q$ will be a stratified Pareto optimum. But it seems like $P_{1}$ and $P_{2}$ should ignore still ignore $P_{3}$ .

[-]Scott Garrabrant9y00

I agree with this. I think that the most interesting direction of future work is to figure out how to have better notions of dependency. I plan on writing some on this in the future, but basically we have not successfully figured out how to deal with this.

[-]Vanessa Kosoy8y00

In the infinite game example, I think that something doesn't add up in the definition of $f$ . A single-valued Kakutani map into a compact space is just a continuous map, but $f$ is not continuous.

[-]Diffractor8y00

It looks legitimate, actually.

Remember, $f$ is set-valued, so if $r_{i - 1} = 0$ , $f (r) = [0, 1]$ . In all other cases, $f (r) = \frac{r_{i - 1}}{2}$ . $f$ is a nonempty convex set-valued function, so all that's left is to show the closed graph property. If the limiting value of $r_{i - 1}$ is something other than 0, the closed graph property holds, and if the limiting value of $r_{i - 1}$ is 0, the closed graph property holds because $0 \in [0, 1]$ .

[-]Vanessa Kosoy8y00

Hi Alex!

I agree that the multimap you described is Kakutani and gives the correct fair set, but in the OP it says that if $r_{i - 1} = 0$ then $f (r) = r_{i}$ , not $f (r) = [0, 1]$ . Maybe I am missing something about the notation?

[-]Stuart_Armstrong9y00

How about using some conception of "coalition-stable"? In which an option has that property if there is no sub-coalition of players that can unilaterally increase their utility, whatever all the other players choose to do.

[-]paulfchristiano9y10

This is basically the core, though it's usually defined for cooperative games, where (a) utility is transferrable and (b) adding a new player never makes an existing player worse off. It's easy to generalize though.

[-]Stuart_Armstrong9y00

Can you use the maximum "swing" from the other player's decisions to get a non-binary measure of dependency?

It's similar to my idea here.

Then the dependency of player $P_{i}$ on $P_{j}$ would be ${max}_{a_{j}} U_{i} - {min}_{a_{j}} U_{i}$ , where $a_{j}$ are the actions of $P_{j}$ .

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

3

Cooperative Oracles: Stratified Pareto Optima and Almost Stratified Pareto Optima

3