Two Notions of Best Response

Wolfgang Spohn develops the concept of a "dependency equilibrium" based on a similar notion of evidential best response (Spohn 2007, 2010). A joint probability distribution $P$ is a dependency equilibrium if all actions of all players that have positive probability are evidential best responses. In case there are actions with zero probability, one evaluates a sequence $(P^{(i)})_{i \in N}$ of joint probability distributions such that ${lim}_{i \to \infty} P^{(i)} = P$ and $P^{(i)} (a) \neq 0$ for all actions $a$ and $i \in N$ . Using your notation of a probability matrix and a utility matrix, the expected utility of an action $a_{j}$ is then defined as the limit of the conditional expected utilities, $lim i \to \infty \frac{U_{j} P_{j}^{(i)}}{| P_{j}^{(i)} |}$ (which is defined for all actions). Say $P$ is a probability matrix with only one zero column, $P_{j}$ . It seems that you can choose an arbitrary nonzero vector $Q_{j}$ , $| Q_{j} | = 1$ to construct, e.g., a sequence of probability matrices $(\frac{i - 1}{i} P + [0, \dots, 0, \frac{1}{i} Q_{j}, 0, \dots, 0])_{i \in N} .$ The expected utilities in the limit for all other actions and the actions of the opponent shouldn't be influenced by this change. So you could choose $Q_{j}$ as the standard vector $e_{i}$ where $i$ is an index such that $U_{j, i} = min U_{j}$ . The expected utility of $a_{j}$ would then be $min U_{j}$ . Hence, this definition of best response in case there are actions with zero probability probably coincides with yours (at least for actions with positive probability—Spohn is not concerned with the question of whether a zero probability action is a best response or not).

The whole thing becomes more complicated with several zero rows and columns, but I would think it should be possible to construct sequences of distributions which work in that case as well.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

0

0