LBIT Proofs 2: Propositions 10-18

Proposition 10: Mixture, updating, and continuous pushforward preserve the properties indicated by the diagram, and always produce an infradistribution.

We'll start with showing that mixture, updating, and continuous pushfoward are always infradistributions, and then turn to property verification.

We know from the last post that mixture, updating, and continuous pushfoward preserve all infradistribution properties (although you need to be careful about whether mixture preserves Lipschitzness, you need that the expected value of the Lipschitz constant is finite), but we added the new one about compact almost-support, so that's the only part we need to re-verify.

To show that mixture has compact almost-support, remember that
$(E_{ζ} h_{i}) (f) = E_{ζ} (h_{i} (f))$
Now, fix an $ϵ$ , we will craft a compact set that accounts for all but $ϵ$ of why functions have the expectation values they do. There is some n where $\sum_{i > n} ζ_{i} λ_{i}^{⊙} < \frac{ϵ}{2}$ , where $λ_{i}^{⊙}$ is the Lipschitz constant of the infradistribution $h_{i}$ . Then, let $C_{ϵ}$ be $⋃_{i \leq n} C_{i, \frac{ϵ}{2}}$ , the union of the compact $\frac{ϵ}{2}$ -almost-supports for the infradistributions $h_{i}, i \leq n$ . This is a finite union of compact sets, so it's compact.

Now we can go:
$| (E_{ζ} h_{i}) (f) - (E_{ζ} h_{i}) (f) | = | E_{ζ} (h_{i} (f)) - E_{ζ} (h_{i} (f^{'})) | \leq \sum_{i} ζ_{i} | h_{i} (f) - h_{i} (f^{'}) |$
$= \sum_{i \leq n} ζ_{i} | h_{i} (f) - h_{i} (f^{'}) | + \sum_{i > n} ζ_{i} | h_{i} (f) - h_{i} (f^{'}) | \leq \sum_{i \leq n} ζ_{i} \frac{ϵ}{2} d (f, f^{'}) + \sum_{i > n} ζ_{i} λ_{i}^{⊙} d (f, f^{'})$
$= d (f, f^{'}) (\frac{ϵ}{2} \sum_{i \leq n} ζ_{i} + \sum_{i > n} ζ_{i} λ_{i}^{⊙}) < d (f, f^{'}) (\frac{ϵ}{2} + \frac{ϵ}{2}) = ϵ d (f, f^{'})$
The first equality is reexpressing mixtures, and the first inequality is moving the expectation outside the absolute value which doesn't decrease value, then we break up the expectation for the second equality. The second inequality is because the gap between $h_{i} (f)$ and $h_{i} (f^{'})$ has a trivial upper bound from the Lipschitzness of $h_{i}$ , and for $i \leq n$ , we have that $f$ and $f^{'}$ agree on the union of the $\frac{ϵ}{2}$ -almost-supports for the $h_{i}, i \leq n$ , so a particular infradistribution, by the definition of an almost-support, has these two expectations having not-very-different values. Then we just pull the gap between $f$ and $f^{'}$ out, and use the fact that for the mixture to work, $\sum_{i} ζ_{i} λ_{i}^{⊙} < \infty$ , and we picked n big enough for that last tail of the infinite sum to be small. Then we're done.

Now, we will show compact almost-support for $h |^{g} L$ assuming $h$ has compact almost-support. Fix an $ϵ$ . Your relevant set for $support (L)$ will be
$C_{\frac{ϵ P_{h}^{g} (L)}{2}} \cap {x | L (x) \geq \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}}}$
Where the first term is a compact set that is a $\frac{ϵ P_{h}^{g} (L)}{2}$ -almost-support for $h$ , and that last set is a sort of "this point must be likely enough". $λ^{⊙}$ will be the Lipschitz constant of the original $h$ . Yes, this intersection may be empty.

Now, here's how things go. Let $f$ and $f^{'}$ agree on that intersection. (if it's the empty set, then it can be any two functions). We can go:
$| (h |^{g} L) (f) - (h |^{g} L) (f^{'}) | = ∣ ∣ ∣ \frac{h (f ★^{L} g) - h (0 ★^{L} g)}{h (1 ★^{L} g) - h (0 ★^{L} g)} - \frac{h (f^{'} ★^{L} g) - h (0 ★^{L} g)}{h (1 ★^{L} g) - h (0 ★^{L} g)} ∣ ∣ ∣$
$= \frac{1}{h (1 ★^{L} g) - h (0 ★^{L} g)} | h (f ★^{L} g) - h (0 ★^{L} g) - h (f^{'} ★^{L} g) + h (0 ★^{L} g) |$
$= \frac{1}{P_{h}^{g} (L)} | h (f ★^{L} g) - h (f^{'} ★^{L} g) | = \frac{1}{P_{h}^{g} (L)} | h (L f + (1 - L) g) - h (L f^{'} + (1 - L) g) |$
So far, this is just a standard sequence of rewrites. The definition of the update, pulling the fraction out, using $P_{h}^{g} (L)$ to abbreviate the rescaling term, and unpacking what $★^{L}$ means.

Now, let's see how different $L f + (1 - L) g$ and $L f^{'} + (1 - L) g$ are on the set $C_{\frac{ϵ P_{h}^{g} (L)}{2}}$ . One of two things will occur. Our first possibility is that an $x$ in that compact set also has $L (x) \geq \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}}$ . Then
$x \in C_{\frac{ϵ P_{h}^{g} (L)}{2}} \cap {x | L (x) \geq \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}}}$
and $f, f^{'}$ were selected to be equal on that set, so the two functions will be identical on that point. Our second possibility is that $x$ in that compact set will have $L (x) < \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}}$ . In that case,
$| L (x) f (x) + (1 - L (x)) g (x) - L (x) f^{'} (x) - (1 - L (x)) g (x) |$
$= | L (x) f (x) - L (x) f^{'} (x) | = L (x) | f (x) - f^{'} (x) | \leq \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}} d (f, f^{'})$
Because $L (x) < δ$ .

Putting this together, $L f + (1 - L) g$ and $L f^{'} + (1 - L) g$ are only $\frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}} d (f, f^{'})$ apart when restricted to the compact set $C_{\frac{ϵ P_{h}^{g} (L)}{2}}$ . By Lemma 2, we can then show that
$| h (L f + (1 - L) g) - h (L f^{'} + (1 - L) g) | \leq λ^{⊙} \cdot \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}} d (f, f^{'}) + \frac{ϵ P_{h}^{g} (L)}{2} d (L f + (1 - L) g, L f^{'} + (1 - L) g)$
And, we also know that:
$d (L f + (1 - L) g, L f^{'} + (1 - L) g) = d (L f, L f^{'}) \leq d (f, f^{'})$
Because $L \in [0, 1]$ . Making that substitution, we have:
$| h (L f + (1 - L) g) - h (L f^{'} + (1 - L) g) | \leq λ^{⊙} \cdot \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}} d (f, f^{'}) + \frac{ϵ P_{h}^{g} (L)}{2} d (f, f^{'})$
$= \frac{ϵ P_{h}^{g} (L)}{2} d (f, f^{'}) + \frac{ϵ P_{h}^{g} (L)}{2} d (f, f^{'}) = ϵ P_{h}^{g} (L) d (f, f^{'})$

Backing up to earlier, we had established that
$| (h |^{g} L) (f) - (h |^{g} L) (f^{'}) | = \frac{1}{P_{h}^{g} (L)} | h (L f + (1 - L) g) - h (L f^{'} + (1 - L) g) |$
and from shortly above, we established that
$| h (L f + (1 - L) g) - h (L f^{'} + (1 - L) g) | \leq ϵ P_{h}^{g} (L) d (f, f^{'})$
Putting these together,
$| (h |^{g} L) (f) - (h |^{g} L) (f^{'}) | < ϵ d (f, f^{'})$
For any two functions $f$ and $f^{'}$ which agree on
$C_{\frac{ϵ P_{h}^{g} (L)}{2}} \cap {x | L (x) \geq \frac{ϵ P_{h}^{g} (L)}{2 λ^{⊙}}}$
Witnessing that said set is an $ϵ$ -almost-support for $h |^{g} L$ .

All we need to finish up is to show that this is a compact set in $support (L)$ equipped with the subspace topology. This can be done by observing that in the original space $X$ it's a compact set, due to being the intersection of a compact set and a closed set. In the subspace topology, if we try to make an open cover of it, all the open sets that cover it in the subspace topology are the restrictions of open sets in the original topology, so we have an open cover of this set in the original topology, and we can make a finite subcover, so it's compact in the subspace topology as well.

Thus, for any $ϵ$ , we can make a compact (in $support (L)$ ) $ϵ$ -almost-support for $h |^{g} L$ , so $h |^{g} L$ has compact almost-support and we've verified the last condition for an update of an infradistribution to be an update.

Now for deterministic pushfoward. Fix an $ϵ$ , and let your appropriate set for $g_{*} (h)$ be $g (C_{ϵ})$ where $C_{ϵ}$ is a compact $ϵ$ -almost-support for $h$ . The image of a compact set is compact, so that part is taken care of. We still need to check that it's an $ϵ$ -almost-support for $g_{*} (h)$ . Let $f, f^{'}$ be equal on this set. Then
$| g_{*} (h) (f) - g_{*} (h) (f^{'}) | = | h (f \circ g) - h (f^{'} \circ g) | \leq ϵ d (f \circ g, f^{'} \circ g)$
$= ϵ {sup}_{x} | f (g (x)), f^{'} (g (x)) | \leq ϵ {sup}_{y} | f (y), f^{'} (y) | = ϵ d (f, f^{'})$
And we're done. This is because, for any point $x \in C_{ϵ}$ , feeding it through $g$ makes a point in $g (C_{ϵ})$ , and feeding it through $f$ and $f^{'}$ produces identical results because they agree on $g (C_{ϵ})$ . Therefore, $f \circ g$ and $f^{'} \circ g$ agree on $C_{ϵ}$ and thus can have values only $ϵ d (f \circ g, f^{'} \circ g)$ apart, which is actually upper-bounded by $ϵ d (f, f^{'})$ . $g (C_{ϵ})$ is thus a compact $ϵ$ -almost-support for $g_{*} (h)$ , and this can be done for any $ϵ$ , so $g_{*} (h)$ has compact almost-support.

Since these three operations always produce infradistributions (as we've shown, we verified the last condition). Updating only has two properties to check, preserving homogenity when $g = 0$ and cohomogenity when $g = 1$ , so let's get that knocked out.

Homogenity using homogenity for h
$(h |^{0} L) (a f) = \frac{h (a f ★^{L} 0) - h (0 ★^{L} 0)}{h (1 ★^{L} 0) - h (0 ★^{L} 0)} = \frac{h (L a f) - h (0)}{h (1 ★^{L} 0) - h (0 ★^{L} 0)} = \frac{a h (L f)}{h (1 ★^{L} 0) - h (0 ★^{L} 0)}$
$= a \frac{h (L f)}{h (1 ★^{L} 0) - h (0 ★^{L} 0)} = a \frac{h (L f) - h (0)}{h (1 ★^{L} 0) - h (0 ★^{L} 0)} = a \frac{h (f ★^{L} 0) - h (0 ★^{L} 0)}{h (1 ★^{L} 0) - h (0 ★^{L} 0)} = a (h |^{0} L) (f)$
Cohomogenity using cohomogenity for h
$(h |^{1} L) (1 + a f) = \frac{h ((1 + a f) ★^{L} 1) - h (0 ★^{L} 1)}{h (1 ★^{L} 1) - h (0 ★^{L} 1)} = \frac{h (L + a L f + 1 - L) - h (1 - L)}{h (1) - h (1 - L)}$
$= \frac{h (1_{a} L f) - h (1 - L)}{1 - h (1 - L)} = \frac{(1 - a + a h (1 + L f)) - h (1 - L)}{1 - h (1 - L)}$
$= \frac{1 - a + a h (1 + L f) - h (1 - L) + a h (1 - L) - a h (1 - L)}{1 - h (1 - L)}$
$= \frac{1 - h (1 - L)}{1 - h (1 - L)} - \frac{a - a h (1 - L)}{1 - h (1 - L)} + \frac{a h (1 + L f) - a h (1 - L)}{1 - h (1 - L)}$
$= \frac{1 - h (1 - L)}{1 - h (1 - L)} - a \frac{1 - h (1 - L)}{1 - h (1 - L)} + a \frac{h (1 + L f) - h (1 - L)}{1 - h (1 - L)}$
$= 1 - a + a \frac{h (1 + L f) - h (1 - L)}{h (1) - h (1 - L)} = 1 - a + a \frac{h (L + L f + (1 - L)) - h (1 - L)}{h (L + (1 - L)) - h (1 - L)}$
$= 1 - a + a \frac{h ((1 + f) ★^{L} 1) - h (0 ★^{L} 1)}{h (1 ★^{L} 1) - h (0 ★^{L} 1)} = 1 - a + a (h |^{1} L) (1 + f)$

Now for mixtures, we'll verify homogenity, 1-Lipschitzness, cohomogenity, C-additivity, and crispness.

Homogenity:
$(E_{ζ} h_{i}) (a f) = E_{ζ} (h_{i} (a f)) = E_{ζ} (a h_{i} (f)) = a E_{ζ} (h_{i} (f)) = a (E_{ζ} h_{i}) (f)$
1-Lipschitz:
$| (E_{ζ} h_{i}) (f) - (E_{ζ} h_{i}) (f^{'}) | = | E_{ζ} (h_{i} (f)) - E_{ζ} (h_{i} (f^{'})) |$
$\leq E_{ζ} | h_{i} (f) - h_{i} (f^{'}) | \leq E_{ζ} d (f, f^{'}) = d (f, f^{'})$
Cohomogenity:
$(E_{ζ} h_{i}) (1 + a f) = E_{ζ} (h_{i} (1 + a f)) = E_{ζ} (1 - a + a h_{i} (1 + f))$
$= 1 - a + a E_{ζ} (h_{i} (1 + f)) = 1 - a + a (E_{ζ} h_{i}) (1 + f)$
C-additivity:
$(E_{ζ} h_{i}) (c) = E_{ζ} (h_{i} (c)) = E_{ζ} (c) = c$
Crispness: Observe that both homogenity and C-additivity are preserved, and crispness is equivalent to the conjunction of the two.

Now for deterministic pushforwards, we'll verify homogenity, 1-Lipschitzness, cohomogenity, C-additivity, crispness, and sharpness.

Homogenity:
$(g_{*} (h)) (a f) = h ((a f) \circ g) = h (a (f \circ g)) = a h (f \circ g) = a (g_{*} (h)) (f)$
1-Lipschitzness:
$| (g_{*} (h)) (f) - (g_{*} (h)) (f^{'}) | = | h (f \circ g) - h (f^{'} \circ g) | \leq d (f \circ g, f^{'} \circ g) \leq d (f, f^{'})$
Cohomogenity:
$(g_{*} (h)) (1 + a f) = h ((1 + a f) \circ g) = h (1 + a (f \circ g)) = 1 - a + a h (1 + (f \circ g))$
$= 1 - a + a h ((1 + f) \circ g) = 1 - a + a (g_{*} (h)) (1 + f)$
C-additivity:
$(g_{*} (h)) (c) = h (c \circ g) = h (c) = c$
Crispness: Both homogenity and C-additivity are preserved, so crispness is preserved too.

Sharpness:
$(g_{*} (h)) (f) = h (f \circ g) = {inf}_{x \in C} f (g (x)) = {inf}_{y \in g (C)} f (y)$
And $g (C)$ is the image of a compact set, so it's compact. And we're done!

Proposition 11: The inf of two infradistributions is always an infradistribution, and inf preserves the infradistribution properties indicated by the diagram at the start of this section.

We'll first verify the infradistribution properties of the inf, and then show it preserves the indicated properties if both components have them.

We must check monotonicity, concavity, normalization, Lipschitzness, and compact almost-support. For monotonicity, if $f^{'} \geq f$ , then
$inf (h_{1}, h_{2}) (f^{'}) = inf (h_{1} (f^{'}), h_{2} (f^{'})) \geq inf (h_{1} (f), h_{2} (f)) = inf (h_{1}, h_{2}) (f)$
This was done by monotonicity for the components. For concavity,
$inf (h_{1}, h_{2}) (p f + (1 - p) f^{'}) = inf (h_{1} (p f + (1 - p) f^{'}), h_{2} (p f + (1 - p) f^{'}))$
$\geq inf (p h_{1} (f) + (1 - p) h_{1} (f^{'}), p h_{2} (f) + (1 - p) h_{2} (f^{'}))$
$\geq inf (p h_{1} (f), p h_{2} (f)) + inf ((1 - p) h_{1} (f^{'}), (1 - p) h_{2} (f^{'}))$
$= p inf (h_{1} (f), h_{2} (f)) + (1 - p) inf (h_{1} (f^{'}), h_{2} (f^{'}))$
$= p inf (h_{1}, h_{2}) (f) + (1 - p) inf (h_{1}, h_{2}) (f^{'})$
The first $\geq$ happened because $h_{1}$ and $h_{2}$ are concave, the second is because $inf (a + b, c + d) \geq inf (a, c) + inf (b, d)$ .

For normalization,
$inf (h_{1}, h_{2}) (1) = inf (h_{1} (1), h_{2} (1)) = inf (1, 1) = 1$
And the same argument applies to 0, so the inf is normalized.

For Lipschitzness, the inf of two Lipschitz functions is Lipschitz.

That just leaves compact almost-support. Fix an arbitary $ϵ$ , and get a $C_{ϵ}^{1}$ compact $ϵ$ -almost-support for $h_{1}$ , and a $C_{ϵ}^{2}$ for $h_{2}$ . We will show that $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ is a compact $ϵ$ -almost-support for $inf (h_{1}, h_{2})$ . It's compact because it's a finite union of compact sets.

Now, let $f$ and $f^{'}$ agree on $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ . We can go:
$| inf (h_{1}, h_{2}) (f) - inf (h_{1}, h_{2}) (f^{'}) | = | inf (h_{1} (f), h_{2} (f)) - inf (h_{1} (f^{'}), h_{2} (f^{'})) |$
There are four possible cases for evaluating this quantity. In case 1, $h_{1} (f) \leq h_{2} (f)$ and $h_{1} (f^{'}) \leq h_{2} (f^{'})$ . Then our above term turns into $| h_{1} (f) - h_{1} (f^{'}) |$ . However, since $f$ and $f^{'}$ agree on $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ , they must agree on $C_{ϵ}^{1}$ , and only have expectations $\leq ϵ d (f, f^{'})$ apart. Case 2 where $h_{1} (f) \geq h_{2} (f)$ and $h_{1} (f^{'}) \geq h_{2} (f^{'})$ is symmetric and can be disposed of by a nearly identical argument, we just do it with $h_{2}$ and $C_{ϵ}^{2}$ .

Case 3 where $h_{1} (f) < h_{2} (f)$ and $h_{1} (f^{'}) > h_{2} (f^{'})$ takes a slightly fancier argument. We can go:
$- ϵ d (f, f^{'}) < h_{1} (f) - h_{1} (f^{'}) < h_{1} (f) - h_{2} (f^{'}) < h_{2} (f) - h_{2} (f^{'}) < ϵ d (f, f^{'})$
The end inequalities are because $f$ and $f^{'}$ agree on the $ϵ$ -almost-supports of $h_{1}$ and $h_{2}$ , respectively, from agreeing on the union. The two inner inequalities are derived from the assumed inequalities in Case 3.
Thus,
$| inf (h_{1} (f), h_{2} (f)) - inf (h_{1} (f^{'}), h_{2} (f^{'})) | = | h_{1} (f) - h_{2} (f^{'}) | < ϵ d (f, f^{'})$
Case 4 where the assumed starting inequalities go in the other direction is symmetric. So, no matter which infradistributions are lower in the two infs, we have
$| inf (h_{1}, h_{2}) (f) - inf (h_{1}, h_{2}) (f^{'}) | = | inf (h_{1} (f), h_{2} (f)) - inf (h_{1} (f^{'}), h_{2} (f^{'})) | < ϵ d (f, f^{'})$
And we're done, we made a compact almost-support for $inf (h_{1}, h_{2})$ assuming an arbitrary $ϵ$ . So the inf of two infradistributions is a infradistribution.

Now to verify homogenity, 1-Lipschitzness, cohomogenity, C-additivity, crispness, and sharpness preservation.

Homogenity:
$inf (h_{1}, h_{2}) (a f) = inf (h_{1} (a f), h_{2} (a f)) = inf (a h_{1} (f), a h_{2} (f))$
$= a inf (h_{1} (f), h_{2} (f)) = a inf (h_{1}, h_{2}) (f)$
1-Lipschitzness:
$| inf (h_{1}, h_{2}) (f) - inf (h_{1}, h_{2}) (f^{'}) | = | inf (h_{1} (f), h_{2} (f)) - inf (h_{1} (f^{'}), h_{2} (f^{'})) |$
Now we can split into four cases. In cases 1 and 2 where the infs turn into $h_{1} (f), h_{1} (f^{'})$ (and same for $h_{2}$ in case 2), we have:
$| inf (h_{1} (f), h_{2} (f)) - inf (h_{1} (f^{'}), h_{2} (f^{'})) | = | h_{1} (f) - h_{1} (f^{'}) | \leq d (f, f^{'})$
(and same for $h_{2}$ ), and we're done with those cases. In cases 3 and 4 where the infs turn into $h_{1} (f), h_{2} (f^{'})$ (and vice-versa for case 4), we have:
$- d (f, f^{'}) \leq h_{1} (f) - h_{1} (f^{'}) < h_{1} (f) - h_{2} (f^{'}) < h_{2} (f) - h_{2} (f^{'}) \leq d (f, f^{'})$
Because $h_{1}$ and $h_{2}$ are 1-Lipschitz. Thus,
$| inf (h_{1} (f), h_{2} (f)) - inf (h_{1} (f^{'}), h_{2} (f^{'})) | = | h_{1} (f) - h_{2} (f^{'}) | \leq d (f, f^{'})$
A symmetric argument works for case 4. So, no matter what,
$| inf (h_{1} (f), h_{2} (f)) - inf (h_{1} (f^{'}), h_{2} (f^{'})) | \leq d (f, f^{'})$
And we're done, the inf is 1-Lipschitz too.

Cohomogenity:
$inf (h_{1}, h_{2}) (1 + a f) = inf (h_{1} (1 + a f), h_{2} (1 + a f))$
$= inf (1 - a + a h_{1} (1 + f), 1 - a + a h_{2} (1 + f))$
$= 1 - a + a inf (h_{1} (1 + f), h_{2} (1 + f))$
$= 1 - a + a inf (h_{1}, h_{2}) (1 + f)$
C-additivity:
$inf (h_{1}, h_{2}) (c) = inf (h_{1} (c), h_{2} (c)) = inf (c, c) = c$
Crispness: Homogenity and C-additivity are both preserved, so crispness is preserved.

Sharpness:
$inf (h_{1}, h_{2}) (f) = inf (h_{1} (f), h_{2} (f)) = inf ({inf}_{x \in C_{1}} f (x), {inf}_{x \in C_{2}} f (x)) = {inf}_{x \in C_{1} \cup C_{2}} f (x)$
And we're done.

Proposition 12: $E_{inf (H_{1}, H_{2})} (f) = inf (E_{H_{1}} (f), E_{H_{2}} (f))$

$E_{inf (H_{1}, H_{2})} (f) = {inf}_{(m, b) \in inf (H_{1}, H_{2})} m (f) + b = {inf}_{(m, b) \in H_{1} \cup H_{2}} m (f) + b$
$= inf ({inf}_{(m, b) \in H_{1}} (m (f) + b), {inf}_{(m, b) \in H_{2}} (m (f) + b))$
$= inf (E_{H_{1}} (f), E_{H_{2}} (f))$

Proposition 13: If a family of infradistributions ${h_{i}}_{i \in I}$ has a shared upper bound on the Lipschitz constant, and for all $ϵ$ , there is a compact set $C_{ϵ}$ that is an $ϵ$ -almost support for all $h_{i}$ , then ${inf}_{i} h_{i}$ , defined as $({inf}_{i} h_{i}) (f) := {inf}_{i} (h_{i} (f))$ , is an infradistribution. Further, for all conditions listed in the table, if all the $h_{i}$ fulfill them, then ${inf}_{i} h_{i}$ fulfills the same property.

We'll first verify the infradistribution properties of the infinite inf, and then show it preserves the indicated properties if all components have them.

We must check monotonicity, concavity, normalization, Lipschitzness, and compact almost-support. For monotonicity, if $f^{'} \geq f$ , then
$({inf}_{i} h_{i}) (f^{'}) = {inf}_{i} (h_{i} (f^{'})) \geq {inf}_{i} (h_{i} (f)) = ({inf}_{i} h_{i}) (f)$
This was done by monotonicity for all components. For concavity,
$({inf}_{i} h_{i}) (p f + (1 - p) f^{'}) = {inf}_{i} (h_{i} (p f + (1 - p) f^{'})) \geq {inf}_{i} (p h_{i} (f) + (1 - p) h_{i} (f^{'}))$
$\geq {inf}_{i} (p h_{i} (f)) + {inf}_{i} ((1 - p) h_{i} (f^{'})) = p {inf}_{i} (h_{i} (f)) + (1 - p) {inf}_{i} (h_{i} (f^{'}))$
$= p ({inf}_{i} h_{i}) (f) + (1 - p) ({inf}_{i} h_{i}) (f^{'})$
The first $\geq$ happened because $h_{1}$ and $h_{2}$ are concave, the second is because ${inf}_{i} (a_{i} + b_{i}) \geq {inf}_{i} (a_{i}) + {inf}_{i} (b_{i})$ .

For normalization,
$({inf}_{i} h_{i}) (1) = {inf}_{i} (h_{i} (1)) = {inf}_{i} (1) = 1$
And the same argument applies to 0, so the inf is normalized.

For Lipschitzness, let $λ^{⊙}$ be your uniform upper bound on the Lipschitz constants of the $h_{i}$ . Then,
$| ({inf}_{i} h_{i}) (f) - ({inf}_{i} h_{i}) (f^{'}) | = | {inf}_{i} (h_{i} (f)) - {inf}_{i} (h_{i} (f^{'})) |$
And then, for all the $h_{i}$ , they only think those functions differ by $λ^{⊙} d (f, f^{'})$ or less, and the same property applies to the inf by picking a $h_{i}$ and $h_{i^{'}}$ that very very nearly attain the two minimums, and showing that if the infinimums were $> λ^{⊙} d (f, f^{'})$ apart, you could have $h_{i} (f^{'})$ appreciably undershoot $h_{i^{'}} (f^{'})$ , and in fact, undershoot ${inf}_{i} (h_{i} (f^{'}))$ , which is impossible. Thus,
$| {inf}_{i} (h_{i} (f)) - {inf}_{i} (h_{i} (f^{'})) | \leq λ^{⊙} d (f, f^{'})$
And we're done.

That just leaves compact almost-support. Fix an arbitary $ϵ$ . We know there is some $C_{ϵ}$ that is a compact $ϵ$ -almost-support for all the $h_{i}$ . We will show that $C_{ϵ}$ is an $ϵ$ -almost-support for ${inf}_{i} h_{i}$ .

Let $f$ and $f^{'}$ agree on $C_{ϵ}$ . We can go:
$| ({inf}_{i} h_{i}) (f) - ({inf}_{i} h_{i}) (f^{'}) | = | {inf}_{i} (h_{i} (f)) - {inf}_{i} (h_{i} (f^{'})) |$
Pick a $h_{i}$ and $h_{i^{'}}$ that very very very nearly attain the inf. Then we can approximately reexpress this quantity as:
$| inf (h_{i} (f), h_{i^{'}} (f)) - inf (h_{i} (f^{'}), h_{i^{'}} (f^{'})) |$
We're approximately in a case where $h_{i} (f) \leq h_{i^{'}} (f)$ and $h_{i} (f^{'}) \geq h_{i^{'}} (f^{'})$ , so we can go:
$- ϵ d (f, f^{'}) \leq h_{i} (f) - h_{i} (f^{'}) < h_{i} (f) - h_{i^{'}} (f^{'}) < h_{i^{'}} (f) - h_{i^{'}} (f^{'}) \leq ϵ d (f, f^{'})$
The end inequalities are because $f$ and $f^{'}$ agree on the $ϵ$ -almost-support of $h_{i}$ and $h_{i^{'}}$ . The two inner inequalities are derived from the assumed inequalities in our case. Thus,
$| {inf}_{i} (h_{i} (f)) - {inf}_{i} (h_{i} (f^{'})) | ≃ | h_{i} (f) - h_{i^{'}} (f^{'}) | \leq ϵ d (f, f^{'})$
And we're done, we made a compact almost-support for ${inf}_{i} h_{i}$ assuming an arbitrary $ϵ$ . So the inf of this family of infradistributions is a infradistribution.

Now to verify homogenity, 1-Lipschitzness, cohomogenity, C-additivity, crispness, and sharpness preservation.

Homogenity:
$({inf}_{i} h_{i}) (a f) = {inf}_{i} (h_{i} (a f)) = {inf}_{i} (a h_{i} (f))$
$= a {inf}_{i} (h_{i} (f)) = a ({inf}_{i} h_{i}) (f)$
1-Lipschitzness: Same as the Lipschitz argument, everyone has a Lipschitz constant of 1, so the inf has the same Lipschitz constant.

Cohomogenity:
$({inf}_{i} h_{i}) (1 + a f) = {inf}_{i} (h_{i} (1 + a f))$
$= {inf}_{i} (1 - a + a h_{i} (1 + f))$
$= 1 - a + a {inf}_{i} (h_{i} (1 + f))$
$= 1 - a + a ({inf}_{i} h_{i}) (1 + f)$
C-additivity:
$({inf}_{i} h_{i}) (c) {inf}_{i} (h_{i} (c)) = {inf}_{i} (c) = c$
Crispness: Homogenity and C-additivity are both preserved, so crispness is preserved.

Sharpness:
$({inf}_{i} h_{i}) (f) = inf (h_{i} (f)) = {inf}_{i} ({inf}_{x \in C_{i}} f (x)) = {inf}_{x \in ⋃_{i} C_{i}} f (x) = {inf}_{x \in ¯ ¯¯¯¯¯¯¯¯¯ ¯ ⋃_{i} C_{i}} f (x)$
We do have to check whether or not $⋃_{i} C_{i}$ is compact, however. We'll start by showing that for an arbitrary $C_{i}$ , any compact set $K$ where $C_{i} ⊈ K$ can't be an $ϵ$ -support of $h_{i}$ for any $ϵ < 1$ . The proof proceeds as follows:

Let $x^{*}$ be some point in $C_{i}$ but not in $K$ . It must be some finite distance away from $K$ . Craft a continuous function $f^{0}$ supported on $K \cup {x^{*}}$ . $f^{0}$ is 1 on $K$ and 0 on ${x^{*}}$ . Use the Tietze extension theorem to extend $f^{0}$ to all of $X$ . Then
$| h_{i} (f^{0}) - h_{i} (1) | = | {inf}_{x \in C_{i}} f^{0} (x) - {inf}_{x \in C_{i}} 1 | = | f^{0} (x^{*}) - 1 | = | 0 - 1 | = 1$
However, $f^{0}$ and 1 agree on $K$ , so $K$ can't be an $ϵ$ -almost-support for any $ϵ < 1$ .

Thus, in order for there to be a compact set $C_{ϵ}$ that's an $ϵ$ -almost-support for all $h_{i}$ , it must be that $\forall i : C_{i} \subseteq C_{ϵ}$ . Then
$¯ ¯¯¯¯¯¯¯¯¯¯ ¯ ⋃_{i} C_{i} \subseteq C_{ϵ}$
because all the $C_{i}$ are in it and $C_{ϵ}$ is closed. So, the closure of our union is a closed subset of a compact set and thus is compact, so ${inf}_{i} h_{i}$ is minimizing over a compact set and thus is crisp.

Proposition 14: If $sup (h_{1}, h_{2}) (0) = 0$ and $sup (h_{1}, h_{2}) (1) = 1$ , then the supremum is an infradistribution.

The supremum is defined as:
$sup (h_{1}, h_{2}) (f) = {sup}_{f_{1}, f_{2}, p : p f_{1} + (1 - p) f_{2} \leq f} p h_{1} (f_{2}) + (1 - p) h_{2} (f_{2})$
We'll verify the infradistribution properties of the sup.

We must check monotonicity, concavity, normalization, Lipschitzness, and compact almost-support. For monotonicity, if $f^{'} \geq f$ , then
$sup (h_{1}, h_{2}) (f) = {sup}_{f_{1}, f_{2}, p : p f_{1} + (1 - p) f_{2} \leq f} p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})$
$\leq {sup}_{f_{1}, f_{2}, p : p f_{1} + (1 - p) f_{2} \leq f^{'}} p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2}) = sup (h_{1}, h_{2}) (f^{'})$
This was done by $f^{'} \geq f$ so there's more options available. For concavity,
$q sup (h_{1}, h_{2}) (f) + (1 - q) sup (h_{1}, h_{2}) (f^{'})$
$= q {sup}_{f_{1}, f_{2}, p : p f_{1} + (1 - p) f_{2} \leq f} p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})$
$+ (1 - q) {sup}_{f_{1}^{'}, f_{2}^{'}, p^{'} : p^{'} f_{1}^{'} + (1 - p^{'}) f_{2}^{'} \leq f} p^{'} h_{1} (f_{1}^{'}) + (1 - p^{'}) h_{2} (f_{2}^{'})$
Pick your $f_{1}, f_{2}, f_{1}^{'}, f_{2}^{'}, p, p^{'}$ that very very very nearly attain the supremum.
$≃ q p h_{1} (f_{1}) + q (1 - p) h_{2} (f_{2}) + (1 - q) p^{'} h_{1} (f_{1}^{'}) + (1 - q) (1 - p^{'}) h_{2} (f_{2}^{'})$
$= (q p + (1 - q) p^{'}) (\frac{q p}{q p + (1 - q) p^{'}} h_{1} (f_{1}) + \frac{(1 - q) p^{'}}{q p + (1 - q) p^{'}} h_{1} (f_{1}^{'}))$
$+ (q (1 - p) + (1 - q) (1 - p^{'})) (\frac{q (1 - p)}{q (1 - p) + (1 - q) (1 - p^{'})} h_{2} (f_{2}) + \frac{(1 - q) (1 - p^{'})}{q (1 - p) + (1 - q) (1 - p^{'})} h_{2} (f_{2}^{'}))$
$\leq (q p + (1 - q) p^{'}) h_{1} (\frac{q p}{q p + (1 - q) p^{'}} f_{1} + \frac{(1 - q) p^{'}}{q p + (1 - q) p^{'}} f_{1}^{'})$
$+ (q (1 - p) + (1 - q) (1 - p^{'})) h_{2} (\frac{q (1 - p)}{q (1 - p) + (1 - q) (1 - p^{'})} f_{2} + \frac{(1 - q) (1 - p^{'})}{q (1 - p) + (1 - q) (1 - p^{'})} f_{2}^{'})$
Also, we can verify that:
$(q p + (1 - q) p^{'}) (\frac{q p}{q p + (1 - q) p^{'}} f_{1} + \frac{(1 - q) p^{'}}{q p + (1 - q) p^{'}} f_{1}^{'})$
$+ (q (1 - p) + (1 - q) (1 - p^{'})) (\frac{q (1 - p)}{q (1 - p) + (1 - q) (1 - p^{'})} f_{2} + \frac{(1 - q) (1 - p^{'})}{q (1 - p) + (1 - q) (1 - p^{'})} f_{2}^{'})$
$= q p f_{1} + (1 - q) p^{'} f_{1}^{'} + q (1 - p) f_{2} + (1 - q) (1 - p^{'}) f_{2}^{'}$
$= q (p f_{1} + (1 - p) f_{2}) + (1 - q) (p^{'} f_{1}^{'} + (1 - p^{'}) f_{2}^{'}) \leq q f + (1 - q) f^{'}$
Therefore, it is a suitable parameter and pair of functions to lower bound
$q f + (1 - q) f^{'}$ . Accordingly
$(q p + (1 - q) p^{'}) h_{1} (\frac{q p}{q p + (1 - q) p^{'}} f_{1} + \frac{(1 - q) p^{'}}{q p + (1 - q) p^{'}} f_{1}^{'})$
$+ (q (1 - p) + (1 - q) (1 - p^{'})) h_{2} (\frac{q (1 - p)}{q (1 - p) + (1 - q) (1 - p^{'})} f_{2} + \frac{(1 - q) (1 - p^{'})}{q (1 - p) + (1 - q) (1 - p^{'})} f_{2}^{'})$
$\leq {sup}_{p^{*}, f_{1}^{*}, f_{2}^{*} : p^{*} f_{1}^{*} + (1 - p^{*}) f_{2}^{*} \leq q f + (1 - q) f^{'}} p^{*} h_{1} (f_{1}^{*}) + (1 - p^{*}) h_{2} (f_{2}^{*}) = sup (h_{1}, h_{2}) (q f + (1 - q) f^{'})$
Putting all this together, and picking better and better approximations to the two suprema, we can conclude that:
$q sup (h_{1}, h_{2}) (f) + (1 - q) sup (h_{1}, h_{2}) (f^{'}) \leq sup (h_{1}, h_{2}) (q f + (1 - q) f^{'})$
And we have concavity.

For normalization, we're assuming it holds at the start.

Lipschitzness takes a slightly more involved argument. Pick two functions $f$ and $f^{'}$ , and without loss of generality, assume $sup (h_{1}, h_{2}) (f) \geq sup (h_{1}, h_{2}) (f^{'})$ . Now, what we can do is pick a $p$ , $f_{1}$ and $f_{2}$ which approximately obtain the defining supremum for $f$ , so we have:
$sup (h_{1}, h)_{2}) (f) ≃ p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})$
Now, we can note two things. First,
$p (f_{1} - d (f, f^{'})) + (1 - p) (f_{2} - d (f, f^{'})) = (p f_{1} + (1 - p) f_{2}) - d (f, f^{'}) \leq f - d (f, f^{'}) \leq f^{'}$
Therefore, the same $p$ , and $f_{1} - d (f, f^{'})$ , and $f_{2} - d (f, f^{'})$ are suitable things to lower-bound the value of $sup (h_{1}, h_{2}) (f^{'})$ . In particular, we have:
$sup (h_{1}, h_{2}) (f^{'}) = {sup}_{q, f_{1}^{'}, f_{2}^{'} : q f_{1}^{'} + (1 - q) f_{2}^{'} \leq f^{'}} q h_{1} (f_{1}^{'}) + (1 - q) h_{2} (f_{2}^{'})$
$\geq p h_{1} (f_{1} - d (f, f^{'})) + (1 - p) h_{2} (f_{2} - d (f, f^{'}))$
Also, we have the result that:
$p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2}) - p h_{1} (f_{1} - d (f, f^{'})) - (1 - p) h_{2} (f_{2} - d (f, f^{'}))$
$= | (p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})) - (p h_{1} (f_{1} - d (f, f^{'})) - (1 - p) h_{2} (f_{2} - d (f, f^{'}))) |$
$\leq | p h_{1} (f_{1}) - p h_{1} (f_{1} - d (f, f^{'})) | + | (1 - p) h_{2} (f_{2}) - (1 - p) h_{2} (f_{2} - d (f, f^{'})) |$
$= p | h_{1} (f_{1}) - h_{1} (f_{1} - d (f, f^{'})) | + (1 - p) | h_{2} (f_{2}) - h_{2} (f_{2} - d (f, f^{'})) |$
$\leq p (λ_{1}^{⊙} \cdot d (f, f^{'})) + (1 - p) (λ_{2}^{⊙} \cdot d (f, f^{'})) \leq max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'})$
Because of Lipschitzness of $h_{1}$ and $h_{2}$ . Now we can begin showing our inequalities. So, we've shown that:
$sup (h_{1}, h_{2}) (f^{'}) \geq p h_{1} (f_{1} - d (f, f^{'})) + (1 - p) h_{2} (f_{2} - d (f, f^{'}))$
Therefore,
$p h_{1} (f_{1} - d (f, f^{'})) + (1 - p) h_{2} (f_{2} - d (f, f^{'})) - sup (h_{1}, h_{2}) (f^{'}) \leq 0$
With this result, we can go:
$max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'})$
$\geq max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'}) + p h_{1} (f_{1} - d (f, f^{'})) + (1 - p) h_{2} (f_{2} - d (f, f^{'})) - sup (h_{1}, h_{2}) (f^{'})$
Let's save this result for a bit later.
Also, we had:
$p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2}) - p h_{1} (f_{1} - d (f, f^{'})) - (1 - p) h_{2} (f_{2} - d (f, f^{'})) \leq max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'})$
And we also picked $p$ and $f_{1}$ and $f_{2}$ to approximately attain the supremum, so we know:
$p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2}) ≃ sup (h_{1}, h_{2}) (f)$
Therefore, we approximately have:
$sup (h_{1}, h_{2}) (f) - p h_{1} (f_{1} - d (f, f^{'})) - (1 - p) h_{2} (f_{2} - d (f, f^{'})) \leq max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'})$
Reshuffling this around a bit, we have:
$max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'}) + p h_{1} (f_{1} - d (f, f^{'})) + (1 - p) h_{2} (f_{2} - d (f, f^{'})) \geq sup (h_{1}, h_{2}) (f)$
Using this with our saved result, we can get:
$max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'})$
$\geq max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'}) + p h_{1} (f_{1} - d (f, f^{'})) + (1 - p) h_{2} (f_{2} - d (f, f^{'})) - sup (h_{1}, h_{2}) (f^{'})$
$\geq sup (h_{1}, h_{2}) (f) - sup (h_{1}, h_{2}) (f^{'}) \geq 0$
That last inequality was because we assumed at the start without loss of generality that $f$ got an equal or higher expectation than $f^{'}$ .
Therefore, we have our result that, in general,
$| sup (h_{1}, h_{2}) (f) - sup (h_{1}, h_{2}) (f^{'}) | \leq max (λ_{1}^{⊙}, λ_{2}^{⊙}) d (f, f^{'})$
And thus, the supremum of two Lipschitz infradistributions is Lipschitz. That just leaves compact almost-support, which is quite tricky to show.

Fix an arbitary $ϵ$ , and get a $C_{ϵ}^{1}$ compact $ϵ$ -almost-support for $h_{1}$ , and a $C_{ϵ}^{2}$ for $h_{2}$ . We will show that $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ is a compact $ϵ$ -almost-support for $sup (h_{1}, h_{2})$ . It's compact because it's a finite union of compact sets.

Now, let $f$ and $f^{'}$ agree on $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ . Without loss of generality, assume that $sup (h_{1}, h_{2}) (f) \geq sup (h_{1}, h_{2}) (f^{'})$ (if not, flip $f$ and $f^{'}$ ). We'll show that they have similar expectations by showing that $sup (h_{1}, h_{2}) (f) - sup (h_{1}, h_{2}) (f^{'})$ is below a small number (we already know that it's above 0 by our without-loss-of-generality assumption).

We can go:
$sup (h_{1}, h_{2}) (f) = {sup}_{p, f_{1}, f_{2} : p f_{1} + (1 - p) f_{2} \leq f} p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2}) ≃ p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})$
Where we picked a particular $p, f_{1}, f_{2}$ spectacularly close to the highest possible value s.t. $p f_{1} + (1 - p) f_{2} \leq f$ . In particular, if $p$ is 0 or 1, we can ensure that $f_{1}$ or $f_{2}$ is $f$ itself, by monotonicity of $h_{1}$ or $h_{2}$ respectively.

For successive arguments, we need $p \in (0, 1)$ so we have to address those endpoints. Assume $p = 1$ . Then, $sup (h_{1}, h_{2}) (f) ≃ h_{1} (f)$ . Then, we have:
$sup (h_{1}, h_{2}) (f) - sup (h_{1}, h_{2}) (f^{'})$
$≃ h_{1} (f) - sup (h_{1}, h_{2}) (f^{'})$
$\leq h_{1} (f^{'}) + ϵ d (f, f^{'}) - sup (h_{1}, h_{2}) (f^{'}) \leq ϵ d (f, f^{'})$
The way this works is our substitution, and then using that $f$ and $f^{'}$ are identical on $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ , and so are identical on $C_{ϵ}^{1}$ , which is $ϵ$ -almost-support of $h_{1}$ , we can upper-bound $h_{1} (f)$ with $h_{1} (f^{'}) + ϵ d (f, f^{'})$ . And then, we just use that $h_{1} (f^{'}) \leq sup (h_{1}, h_{2}) (f^{'})$ . If $p = 0$ , the exact same argument works, just with $h_{2}$ and $C_{ϵ}^{2}$ instead. That leaves the case where $p \in (0, 1)$ , which requires far more involved arguments.

As a recap, we're assuming that $sup (h_{1}, h_{2}) (f) \geq sup (h_{1}, h_{2}) (f^{'})$ , and that $sup (h_{1}, h_{2}) (f) ≃ p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})$ , and $p \in (0, 1)$ . Now, we're going to pick out a continuous function with some special properties, so let the set-valued function $ψ : X \to R^{2}$ be defined as: If $x \in C_{ϵ}^{1} \cup C_{ϵ}^{2}$ , then $ψ (x) = (f_{1} (x), f_{2} (x))$ . Otherwise, $ψ (x)$ equals the intersection of:
$[f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})] \times [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$
and
${(y, z) | p y + (1 - p) z \leq f^{'} (x)}$
We'll find a continuous selection of this set-valued function, so let's start checking the properties needed to invoke the Michael selection theorem. We need that $X$ is paracompact (all polish spaces are paracompact, check), that $R^{2}$ is a Banach space (check), that for all $x$ , $ψ (x)$ is convex (it's either a single point or the intersection of a rectangle and a half-space, which is convex in both cases), closed (yup, it's either a point or the intersection of two closed sets, ie closed), nonempty, and lower-hemicontinuous.

Nonemptiness isn't too bad to show. It's nonempty for all points in our compact set of interest (the set consisting of a single point), and for x not in said set, $(f_{1} (x) - d (f, f^{'}), f_{2} (x) - d (f, f^{'}))$ witnesses the nonemptiness, because:
$p (f_{1} (x) - d (f, f^{'})) + (1 - p) (f_{2} (x) - d (f, f^{'})) = p f_{1} (x) + (1 - p) f_{2} (x) - d (f, f^{'})$
$\leq f (x) - d (f, f^{'}) \leq f^{'} (x)$
Lower-hemicontinuity is much more challenging to establish. Again, we have a sequence $x_{n}$ limiting to $x$ , a point $(y, z) \in ψ (x)$ , and we must find a subsequence $(y_{m}, z_{m}) \in ψ (x_{m})$ which limits to $(y, z)$ .

We can divide into three cases. In the first case, $x$ lies in $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ , and infinitely many members of the sequence lie in said set. In particular, since $x$ lies in the compact set, the $(y, z)$ pair associated with it must be $(f_{1} (x), f_{2} (x))$ . Then we can isolate that particular subsequence that lies in the compact set, and have $(y_{m}, z_{m})$ be $(f_{1} (x_{m}), f_{2} (x_{m}))$ , which, by continuity of $f_{1}$ and $f_{2}$ , and the definition of $ψ (x_{m})$ for $x_{m}$ in the compact set, lie in $ψ (x_{m})$ and limit to $(y, z)$ ie $(f_{1} (x), f_{2} (x))$ .

In preparation for the second and third cases, we'll show that the function $ψ^{'} : X \to R^{2}$ which just takes the second branch of the $ψ$ function is continuous w.r.t. the Hausdorff-metric. Ie, for all $x$ ,
$ψ^{'} (x) := [f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})] \times [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$
$\cap {(y, z) | p y + (1 - p) z \leq f^{'} (x)}$
is continuous when the space of compact subsets of $R^{2}$ is equipped with a Hausdorff distance.

Accordingly, let $x_{m}$ limit to $x$ . Our task is to show that, no matter how tiny of a number you name, you can find a tail of the $x_{m}$ sequence where the Hausdorff distance between $ψ^{'} (x_{m})$ and $ψ^{'} (x)$ is that tiny.

Specifically, we'll show that for all $δ$ , there is some $m$ where all later $x_{m}$ have $ψ^{'} (x_{m})$ within $\frac{2 δ}{p} + \frac{2 δ}{1 - p} + 4 δ$ Hasdorff distance of $ψ^{'} (x)$ . Because $p \in (0, 1)$ and we can shrink $δ$ to 0, this shows that the function $ψ^{'}$ is continuous in Hausdorff-distance.

Because $f_{1}$ and $f_{2}$ and $f^{'}$ are continuous functions, there's some very very large $m$ where $f_{1}$ , $f_{2}$ , and $f^{'}$ will only vary by $δ$ from that point forward, regardless of which $δ$ you pick. Pick some arbitrary $(y_{m}, z_{m}) \in ψ^{'} (x_{m})$ . We'll show that it's close to a $(y, z) \in ψ^{'} (x)$ , and the argument will only depend on distances, not position in sequence, so we can flip it to show the other half of Hausdorff-distance (all points in $ψ^{'} (x)$ are close to a point in $ψ^{'} (x_{m})$ ).

We can divide into four possible cases. In cases 1 and 2, we have the following property holding.
$y_{m} \geq f_{1} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{p} + δ$
With the negation for cases 3 and 4.

And in cases 1 and 3, we have:
$z_{m} \geq f_{2} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{1 - p} + δ$
With the negation for cases 2 and 4.

In cases 1 and 2, you can let your selected $y$ point be $y_{m} - \frac{2 δ}{p}$ . We have the result that $y \in [f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})]$ , because:
$f_{1} (x) + d (f, f^{'}) \geq f_{1} (x_{m}) - δ + d (f, f^{'}) \geq y_{m} - δ \geq y_{m} - \frac{2 δ}{p} = y$
$\geq f_{1} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{p} + δ - \frac{2 δ}{p} = f_{1} (x_{m}) - d (f, f^{'}) + δ \geq f_{1} (x) - d (f, f^{'})$
In order, the first inequality is because $f_{1}$ only varies by $δ$ over such tiny distances due to continuity of $f_{1}$ , the second inequality is $y_{m}$ being paired with something to be in $ψ^{'} (x_{m})$ so it has a known upper bound on its value, then the third inequality is because $p < 1$ , the equality is our definition of our $y$ , then for the next inequality using the fact that we're assuming that $y_{m}$ has a particular lower bound since we're in cases 1 and 2, Then there's just a cancellation, and $f_{1}$ only varying by $δ$ over such tiny distances.

You can use nearly identical arguments in cases 1 and 3 to get that, when you define $z$ to be $z_{m} - \frac{2 δ}{1 - p}$ . you have the result that $z \in [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$

Now, in cases 3 and 4, we can let $y$ be: $f_{1} (x) - d (f, f^{'})$ , and then we have:
$- δ = f_{1} (x) - δ - d (f, f^{'}) - (f_{1} (x) - d (f, f^{'}))$
$= f_{1} (x) - δ - d (f, f^{'}) - y \leq f_{1} (x_{m}) - d (f, f^{'}) - y \leq y_{m} - y$
$\leq f_{1} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{p} + δ - y \leq f_{1} (x) + δ - d (f, f^{'}) + \frac{2 δ}{p} + δ - y$
$= f_{1} (x) - d (f, f^{'}) + \frac{2 δ}{p} + 2 δ - y = f_{1} (x) - d (f, f^{'}) + \frac{2 δ}{p} + 2 δ - (f_{1} (x) - d (f, f^{'}))$
$= \frac{2 δ}{p} + 2 δ$
The first equality is just pair-creation, then the second one is packing up the definition of $y$ . The first inequality is because $f_{1}$ only varies by $δ$ over that distance, the second inequality is because $y_{m} \in ψ^{'} (x_{m})$ so it's got the usual lower bound, then the next inequality after that is because we're in cases 3 and 4 so
$y_{m} < f_{1} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{p} + δ$
Then, it's just another " $f_{1}$ doesn't change much over the tiny distance", moving the $δ$ 's together, unpacking $y$ , and cancelling out. The net result is that we have:
$| y_{m} - y | \leq \frac{2 δ}{p} + 2 δ$
You can use nearly identical arguments in cases 2 and 4 to get that, when you define $z$ to be be $f_{2} (x) - d (f, f^{'})$ you have the result that $| z_{m} - z | \leq \frac{2 δ}{1 - p} + 2 δ$ .

At this point, we can resume our progress on the four cases and go "ok, in case 1, we have..."
$y_{m} \geq f_{1} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{p} + δ$
$z_{m} \geq f_{2} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{1 - p} + δ$
And we know that those properties lead to $y$ being defined as $y_{m} - \frac{2 δ}{p}$ and $z$ being defined as $z_{m} - \frac{2 δ}{p}$ . And we know that in that case,
$(y, z) \in [f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})] \times [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$
So, all we have to check is that $p y + (1 - p) z \leq f^{'} (x)$ in order to conclude that $(y, z) \in ψ^{'} (x)$ . Let's do that.
$p y + (1 - p) z = p (y_{m} - \frac{2 δ}{p}) + (1 - p) (z_{m} - \frac{2 δ}{1 - p}) = p y_{m} + (1 - p) z_{m} - 4 δ$
$\leq f^{'} (x_{m}) - 4 δ \leq f^{'} (x) + δ - 3 δ = f^{'} (x) - 2 δ < f^{'} (x)$
And we have that $(y, z) \in ψ^{'} (x)$ , accordingly. The first equality was unpacking definitions, then the second was some cancellation, and then the first inequality was because $(y_{m}, z_{m}) \in ψ^{'} (x_{m})$ by assumption so we have $p y_{m} + (1 - p) z_{m} \leq f^{'} (x_{m})$ . The second inequality was because $f^{'}$ doesn't change much over such tiny distances and then it's just trivial cleanup.

Thus, when we picked a point $(y_{m}, z_{m}) \in ψ^{'} (x_{m})$ where $x_{m}$ is sufficiently close to $x$ , and we're in case 1, we have that there are points $(y, z) \in ψ^{'} (x)$ , and
$d ((y_{m}, z_{m}), (y, z)) = | y_{m} - y | + | z_{m} - z | = ∣ ∣ y_{m} - y_{m} + \frac{2 δ}{p} ∣ ∣ + ∣ ∣ z_{m} - z_{m} + \frac{2 δ}{(1 - p)} ∣ ∣$
$= \frac{2 δ}{p} + \frac{2 δ}{(1 - p)}$
This is from the definitions of $y$ and $z$ in Case 1.

Now, let's address case 2, where
$y_{m} \geq f_{1} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{p} + δ$
$z_{m} < f_{2} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{1 - p} + δ$
In this case, $y$ is defined as $y_{m} - \frac{2 δ}{p}$ , and $z$ is defined as $f_{2} (x) - d (f, f^{'})$
And we know that in that case,
$(y, z) \in [f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})] \times [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$
(the first part on the $y$ is the same argument from case 1, the second interval is from the value of $z$ )
So, all we have to check is that $p y + (1 - p) z \leq f^{'} (x)$ in order to conclude that $(y, z) \in ψ^{'} (x)$ . We know that
$- δ \leq z_{m} - z$
So we can flip this a bit to get
$z \leq z_{m} + δ$
Accordingly, from that, we get:
$p y + (1 - p) z \leq p (y_{m} - \frac{2 δ}{p}) + (1 - p) (z_{m} + δ) = p y_{m} + (1 - p) z_{m} - 2 δ + (1 - p) δ$
$\leq f^{'} (x_{m}) - δ \leq f^{'} (x)$
And we have that $(y, z) \in ψ^{'} (x)$ , accordingly.The first inequality was definition unpacking and the inequality we just got, then the first equality is just breaking things up a bit, then the second inequality is just observing that $1 - p \leq 1$ , and then $f^{'}$ doesn't change much over such tiny distances.

Thus, when we picked a point $(y_{m}, z_{m}) \in ψ^{'} (x_{m})$ where $x_{m}$ is sufficiently close to $x$ , and we're in case 2, we have that there are points $(y, z) \in ψ^{'} (x)$ , and
$d ((y_{m}, z_{m}), (y, z)) = | y_{m} - y | + | z_{m} - z |$
$\leq ∣ ∣ y_{m} - y_{m} + \frac{2 δ}{p} ∣ ∣ + \frac{2 δ}{1 - p} + 2 δ = \frac{2 δ}{p} + \frac{2 δ}{(1 - p)} + 2 δ$
This is from the definitions of $y$ and $z$ in Case 2, and the fact that in case 2 we can derive $| z_{m} - z | \leq \frac{2 δ}{1 - p} + 2 δ$

Extremely similar arguments to case 2 dispatch case 3 with a resolution of the corresponding $(y, z)$ lying in $ψ^{'} (x)$ and
$d ((y_{m}, z_{m}), (y, z)) \leq \frac{2 δ}{p} + \frac{2 δ}{(1 - p)} + 2 δ$

Finally, for case 4, we have:
$y_{m} < f_{1} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{p} + δ$
$z_{m} < f_{2} (x_{m}) - d (f, f^{'}) + \frac{2 δ}{1 - p} + δ$
In this case, $y$ is defined as $f_{1} (x) - d (f, f^{'})$ , and $z$ is defined as $f_{2} (x) - d (f, f^{'})$
Trivially, we have:
$(y, z) \in [f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})] \times [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$
So, all we have to check is that $p y + (1 - p) z \leq f^{'} (x)$ in order to conclude that $(y, z) \in ψ^{'} (x)$ . To do this, we have:
We know that
$- δ \leq z_{m} - z$
So we can flip this a bit to get
$z \leq z_{m} + δ$
Accordingly, from that, we get:
$p y + (1 - p) z = p (f_{1} (x) - d (f, f^{'}) + (1 - p) (f_{2} (x) - d (f, f^{'}))$
$= p f_{1} (x) + (1 - p) f_{2} (x) - d (f, f^{'}) \leq f (x) - d (f, f^{'}) \leq f^{'} (x)$
(because $p f_{1} + (1 - p) f_{2} \leq f$ )
And we have that $(y, z) \in ψ^{'} (x)$ , accordingly.

Thus, when we picked a point $(y_{m}, z_{m}) \in ψ^{'} (x_{m})$ where $x_{m}$ is sufficiently close to $x$ , and we're in case 4, we have that there are points $(y, z) \in ψ^{'} (x)$ , and
$d ((y_{m}, z_{m}), (y, z)) = | y_{m} - y | + | z_{m} - z | \leq \frac{2 δ}{1 - p} + 2 δ + \frac{2 δ}{1 - p} + 2 δ = \frac{2 δ}{p} + \frac{2 δ}{(1 - p)} + 4 δ$
This is from the definitions of $y$ and $z$ in Case 4, and the fact that in case 4 we can derive $| y_{m} - y | \leq \frac{2 δ}{1 - p} + 2 δ$ (and same for $z_{m}$ )

These 4 cases were exhaustive, so we now know that, given any $x$ and sequence of points $x_{m}$ limiting to $x$ , and any $δ$ , there is a tail of sufficiently large m's where the distance from any point in $ψ^{'} (x_{m})$ to $ψ^{'} (x)$ is $\frac{2 δ}{p} + \frac{2 δ}{(1 - p)} + 4 δ$ or less. We can also flip $x_{m}$ and $x$ and use our four cases (our argument is symmetric) to show that actually, this is a bound on the Hausdorff distance between $ψ^{'} (x_{m})$ and $ψ^{'} (x)$ . $δ$ was arbitrary, as was the sequence $x_{m}$ and the $x$ , so this means that $ψ^{'}$ is continuous in Hausdorff-distance.

Ok, we're a bit in the weeds here, how does that help? Well, we were trying to verify the compact almost-support property for the supremum. This requires, as part of it, getting a continuous function with some special properties. We're going to apply a selection function to get it, but we could only take care of the prerequisites that aren't lower-hemicontinuity. And to show lower-Hemicontinuity in general, we needed to take this detour through showing that the modified set-valued function is continuous in Hausdorff-distance. So let's pop back up the stack.

One level back up the stack, we were trying to show lower-Hemicontinuity. It is the property that given any sequence $x_{n}$ which limits to $x$ , and any $(y, z) \in ψ (x)$ , there is some subsequence $x_{m}$ and $(y_{m}, z_{m}) \in ψ (x_{m})$ where $(y_{m}, z_{m})$ limits to $ψ (x_{m})$ . We dispatched the case where infinitely many elements of the sequence were in our $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ , leaving two cases. There's the case where only finitely elements of that sequence are in that compact set, but the limit point $x$ lies in that set. There's also the case where the limit point $x$ doesn't lie in that set.

Dealing with case 3, we have a sequence $x_{n}$ heading to $x$ . Strip off all the $x_{n}$ that lie in the compact set, making your $x_{m}$ . And let $(y_{m}, z_{m})$ be whichever point in $ψ (x_{m})$ is closest to $(y, z)$ . Now, by how they were defined, $ψ (x_{m}) = ψ^{'} (x_{m})$ , and $ψ^{'}$ is continuous in Hausdorff-distance, so "take the closest point" is definitely going to get you the convergence you seek to your arbitrarily selected $(y, z) \in ψ (x)$ point.

For case 2, where we're limiting to $x$ from outside the compact set, all we need to show is that $ψ (x) \subseteq ψ^{'} (x)$ (we don't necessarily have equality because $ψ$ and $ψ^{'}$ start being different on that compact set), in order to get a sequence $(y_{m}, z_{m})$ converging to the $(y, z) \in ψ (x)$ point. So, let's do this. Because $x$ lies in $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ , we have that $ψ (x) = (f_{1} (x), f_{2} (x))$ .
The conditions for $(y, z)$ to be in $ψ^{'} (x)$ are:
$(y, z) \in [f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})] \times [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$
Which is obviously true for $f_{1} (x), f_{2} (x)$ , and:
$p y + (1 - p) z \leq f^{'} (x)$
Which is the case because:
$p f_{1} (x) + (1 - p) f_{2} (x) \leq f (x) = f^{'} (x)$
By how $f$ was made, and $f^{'} = f$ on that compact set.

Thus, we're done, we verified lower-hemicontinuity for $ψ$ in all the cases, so we can invoke the Michael selection theorem and get a continuous selection $f^{*} : X \to R^{2}$ with three valuable properties. Let's abbreviate $p r_{1} (f^{*} (x))$ as $f_{1}^{'}$ , for notational convenience. It's projecting it to the first coordinate. $f_{2}^{'}$ is defined similarly.

Our first notable property is:
$x \in C_{ϵ}^{1} \cup C_{ϵ}^{2} \to f_{1}^{'} (x) = f_{1} (x) \land f_{2}^{'} (x) = f_{2} (x)$
(ie, projecting $f^{*}$ down to the two coordinates makes functions which perfectly mimic $f_{1}$ and $f_{2}$ on the compact set of interest)

Our second one is:
$d (f_{1}, f_{1}^{'}) \leq d (f, f^{'})$
And the same for $d (f_{2}, f_{2}^{'})$ .

And our third notable property is that:
$p f_{1}^{'} + (1 - p) f_{2}^{'} \leq f^{'}$
But why do these properties hold of our selection function? Well, when $x$ lies in that compact set, $ψ (x) = (f_{1} (x), f_{2} (x))$ , so our selection function is forced to have its projections mimic $f_{1}$ and $f_{2}$ on said compact set, taking care of the first one.

For our second property, we have:
$f^{*} (x) \in ψ (x) \subseteq [f_{1} (x) - d (f, f^{'}), f_{1} (x) + d (f, f^{'})] \times [f_{2} (x) - d (f, f^{'}), f_{2} (x) + d (f, f^{'})]$
Accordingly, we know that the projections to the two coordinates can't be too far away from $f_{1}$ and $f_{2}$ respectively.

For our third property, we have:
$f^{*} (x) \in ψ (x) \subseteq {(y, z) | p y + (1 - p) z \leq f^{'} (x)}$
Accordingly, the projections to the two coordinates can't mix to exceed the function $f^{'}$ .

So, where to from here? Well, we have:
$| sup (h_{1}, h_{2}) (f) - sup (h_{1}, h_{2}) (f^{'}) | = sup (h_{1}, h_{2}) (f) - sup (h_{1}, h_{2}) (f^{'})$
$≃ p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2}) - sup (h_{1}, h_{2}) (f^{'})$
$\leq p (h_{1} (f_{1}^{'}) + ϵ d (f, f^{'})) + (1 - p) h_{2} (f_{2}^{'}) + ϵ d (f, f^{'})) - sup (h_{1}, h_{2}) (f^{'})$
$= p h_{1} (f_{1}^{'}) + (1 - p) h_{2} (f_{2}^{'}) - sup (h_{1}, h_{2}) (f^{'}) + ϵ d (f, f^{'}) \leq ϵ d (f, f^{'})$
Here's why. We assumed at the very start that without loss of generality, we'd take $f$ to be the one with higher expectation value. We found a $p, f_{1}, f_{2}$ that nearly replicated the expectation value of $sup (h_{1}, h_{2}) (f)$ . $f_{1}^{'}$ copies $f_{1}$ on a compact almost-support of $h_{1}$ , namely $C_{ϵ}^{1}$ , and we also have $d (f_{1}^{'}, f_{1}) \leq d (f^{'}, f)$ , and similar for $f_{2}^{'}$ and $f_{2}$ . And finally, since $p f_{1}^{'} + (1 - p) f_{2}^{'} \leq f^{'}$ , that mix must have lower value than $sup (h_{1}, h_{2}) (f^{'})$ . And we're done! $f$ and $f^{'}$ were arbitrary except that they agreed on $C_{ϵ}^{1} \cup C_{ϵ}^{2}$ , a compact set, and we got:
$sup (h_{1}, h_{2}) (f) - sup (h_{1}, h_{2}) (f^{'}) \leq ϵ d (f, f^{'})$
Witnessing that said set is a compact $ϵ$ -almost-support. $ϵ$ was arbitrary, so $sup (h_{1}, h_{2})$ is compactly-almost-supported. This is the last condition needed to check to see that it's an infradistribution.

Proposition 15: All three characterizations of the supremum given in Definition 13 are identical.

So, the first characterization we gave was:
$sup (h_{1}, h_{2}) (f) := {sup}_{p, f_{1}, f_{2} : p f_{1} + (1 - p) f_{2} \leq f} p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})$
And the second characterization was the least infradistribution greater than $h_{1}, h_{2}$ in the information ordering.

And the third characterization was as the concave monotone hull of $f \mapsto sup (h_{1} (f), h_{2} (f))$ .

We will use ${sup}_{1}, {sup}_{2}, {sup}_{3}$ for these three characterizations of the supremum of two infradistributions and show that they are equal.

Let's begin showing this.
${sup}_{2} (h_{1}, h_{2}) (f) \geq {sup}_{ζ \in Δ N, f_{i} \in \prod_{i} C_{B} (X) : E_{ζ} f_{i} \leq f} ({sup}_{2} (h_{1}, h_{2}) (E_{ζ} (f_{i})))$
This occurs by monotonicity, any mix of functions which undershoots $f$ must get a lower score because ${sup}_{2} (h_{1}, h_{2})$ is an infradistribution.
$\geq {sup}_{ζ \in Δ N, f_{i} \in \prod_{i} C_{B} (X) : E_{ζ} f_{i} \leq f} E_{ζ} ({sup}_{2} (h_{1}, h_{2}) (f))$
This is because of convexity of ${sup}_{2} (h_{1}, h_{2})$ , since it's an infradistribution. The value of the mix is as good or better than the mix of the values.
$\geq {sup}_{ζ \in Δ N, f_{i} \in \prod_{i} C_{B} (X) : E_{ζ} f_{i} \leq f} E_{ζ} (sup (h_{1} (f_{i}), h_{2} (f_{i}))) = {sup}_{3} (h_{1}, h_{2}) (f)$
This is because ${sup}_{2} (h_{1}, h_{2}) \geq h_{1}$ (and same for $h_{2}$ ), so making that swap decreases the value. Also, this quantity is the concave monotone hull of the supremum of $h_{1}, h_{2}$ . Why? Well, $sup (h_{1} (f), h_{2} (f))$ is our first attempt at assessing the value of a function $f$ . However, it isn't necessarily monotone. So, ${sup}_{f^{*} \leq f} sup (h_{1} (f^{*}), h_{2} (f^{*}))$ is the monotone hull, we're saying that if there's a value below you that outscores you, then you should update the value of $f$ to be big enough. And then, to get the concave monotone hull, we replace the lower bound on $f$ with a countable/arbitrary finite mix of functions because any concave function should have the value of the mix be $\geq$ the mix of the values, so we have to bump the value of $f^{*}$ up to at least the mix of the values to not violate concavity. Anyways, now that we know this is ${sup}_{3} (h_{1}, h_{2}) (f)$ , we can go further to:
$\geq {sup}_{p, f_{1}, f_{2} : p f_{1} + (1 - p) f_{2} \leq f} (p sup (h_{1} (f_{1}), h_{2} (f_{1})) + (1 - p) sup (h_{1} (f_{2}), h_{2} (f_{2})))$
This is lower because now we're specializing to only certain sorts of probability distributions over $N$ , those that are only supported on the first two values, so it's harder to attain suprema. And now,
$\geq {sup}_{p, f_{1}, f_{2} : p f_{1} + (1 - p)_{2} \leq f} (p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})) = {sup}_{1} (h_{1}, h_{2}) (f)$
We swapped out the supremum for a specific term in it in order to do this, and used our given definition of ${sup}_{1}$ . And then we can specialize $p$ to 1 and $f_{1}$ to $f$ itself, to get
$\geq h_{1} (f)$
Similarly, we could specialize to $p = 0$ and $f_{2} = f$ to get $\geq h_{2} (f)$ . So taking stock of what we have,
${sup}_{2} (h_{1}, h_{2}) (f) \geq {sup}_{3} (h_{1}, h_{2}) (f) \geq {sup}_{1} (h_{1}, h_{2}) (f) \geq sup (h_{1} (f), h_{2} (f))$
For all functions, so:
${sup}_{2} (h_{1}, h_{2}) \geq {sup}_{3} (h_{1}, h_{2}) \geq {sup}_{1} (h_{1}, h_{2}) \geq h_{1}$
(and same for $h_{2}$ ) We recall that in Proposition 14 we proved that ${sup}_{1}$ always makes an infradistribution. Since ${sup}_{1}$ is above both component infradistributions, and ${sup}_{2}$ was defined as the least infradistribution that is above $h_{1}$ and $h_{2}$ , we must have equality, and
${sup}_{2} (h_{1}, h_{2}) = {sup}_{3} (h_{1}, h_{2}) = {sup}_{1} (h_{1}, h_{2}) \geq h_{1}$
(and same for $h_{2}$ ) And we've shown the three definitions of the supremum are identical.

Proposition 16:
$E_{sup (H_{1}, H_{2})} (f) = {sup}_{p, f_{1}, f_{2} : p f_{1} + (1 - p) f_{2} \leq f} p E_{H_{1}} (f_{1}) + (1 - p) E_{H_{2}} (f_{2})$

To recap,
$sup (H_{1}, H_{2}) := H_{1} \cap H_{2}$
Now, $sup (H_{1}, H_{2})$ can be turned into a concave monotone functional $C_{B} (X) \to R$ , by LF-duality. Further, it's convex, closed, and upper-complete due to being the intersection of two convex closed upper-complete sets. Let's use $h$ to refer to its corresponding functional. Then:
$h (f) = E_{sup (H_{1}, H_{2})} (f) = {inf}_{(m, b) \in H_{1} \cap H_{2}} m (f) + b$
$\geq {inf}_{(m, b) \in H_{1}} m (f) + b = E_{H_{1}} (f) = h_{1} (f)$
And the same applies to $H_{2}$ , and this applies to all functions, so $h \geq h_{1}$ (and same for $h_{2}$ ).

We know from Proposition 15 that the least concave monotone functional above $h_{1}$ and $h_{2}$ is $sup (h_{1}, h_{2})$ , so $h \geq sup (h_{1}, h_{2}) \geq h_{1}$ (and same for $h_{2}$ ) Call the corresponding set of $sup (h_{1}, h_{2})$ as $H_{sup}$ . Thus, translating this information ordering back to sets,
$sup (H_{1}, H_{2}) \subseteq H_{sup} \subseteq H_{1}$
And same for $H_{2}$ . Therefore.
$sup (H_{1}, H_{2}) \subseteq H_{sup} \subseteq H_{1} \cap H_{2} = sup (H_{1}, H_{2})$
Therefore, all the subsets must be actual equalities, and so in particular we have:
$H_{sup} = sup (H_{1}, H_{2})$
Then we can go:
$E_{sup (H_{1}, H_{2})} (f) = sup (h_{1}, h_{2}) (f)$
$= {sup}_{p, f_{1}, f_{2}} (p h_{1} (f_{1}) + (1 - p) h_{2} (f_{2})) = {sup}_{p, f_{1}, f_{2} : p f_{1} + (1 - p) f_{2} \leq f} (p E_{H_{1}} (f_{1}) + (1 - p) E_{H_{2}} (f_{2}))$
By $sup (H_{1}, H_{2})$ being equivalent to the infradistribution set induced by $sup (h_{1}, h_{2})$ , expanding our definition of the sup, and translating back. And we're done!

Proposition 17: For any property in the table at the start of this section, $sup (h_{1}, h_{2})$ will fulfill the property if both components fulfill the property.

The way to show this is to use the alternate characterizations of supremum as intersection of the infradistribution sets, and the alternate characterizations of the various properties in terms of properties of minimal points.

We will make an observation used in all further proofs of properties. In order for $H_{1}$ to have $(λ μ, b)$ in it, there must be a minimal point of the form $(λ μ, b_{1})$ with $b_{1} \leq b$ below it. Similarly, for $H_{2}$ to contain $(λ μ, b)$ , there must be a minimal point of the form $(λ μ, b_{2})$ below it, with $b_{2} \leq b$ .

Thus, for $(λ μ, b)$ to lie in $(H_{1} \cap H_{2})^{min}$ , $(λ μ, b_{1}) \in H_{1}^{min}$ and $(λ μ, b_{2}) \in H_{2}^{min}$ and $b = sup (b_{1}, b_{2})$ . Part of this is because said point lies in $H_{1}$ and $H_{2}$ , the other part is because $(λ μ, sup (b_{1}, b_{2}))$ is the lowest possible point in $H_{1} \cap H_{2}$ associated with a measure component of $λ μ$ , and it's the minimal. This observation will be used for all future sub-proofs in this proposition.

Homogenity: This is equivalent to "all minimal points have $b = 0$ ", so if $(λ μ, b) \in (H_{1} \cap H_{2})^{min}$ , then $(λ μ, 0) \in H_{1}^{min}$ (homogenity for $H_{1}$ ), and same for $H_{2}^{min}$ , so $b = sup (0, 0) = 0$ .

1-Lipschitzness: This is equivalent to "all minimal points have $λ \leq 1$ ", so if $(λ μ, b) \in (H_{1} \cap H_{2})^{min}$ , then $(λ μ, b_{1}) \in H_{1}^{min}$ , and $λ \leq 1$ (1-Lipschitzness of $H_{1}$ ), so $λ \leq 1$ .

Cohomogenity: This is equivalent to "all minimal points have $λ + b = 1$ ", so if $(λ μ, b) \in (H_{1} \cap H_{2})^{min}$ , then $(λ μ, b_{1}) \in H_{1}^{min}$ , and $λ + b_{1} = 1$ (cohomogenity of $H_{1}$ ), and $(λ μ, b_{2}) \in H_{2}^{min}$ , and $λ + b_{2} = 1$ , so $b_{1} = b_{2}$ . Then, $λ + b = λ + sup (b_{1}, b_{2}) = λ + b_{1} = 1$ .

C-additivity: This is equivalent to "all minimal points have $λ = 1$ ", so if $(λ μ, b) \in (H_{1} \cap H_{2})^{min}$ , then $(λ μ, b_{1}) \in H_{1}^{min}$ , and $λ = 1$ (C-additivity of $H_{1}$ ), so $λ = 1$ .

Crispness: This is equivalent to the conjunction of homogenity and C-additivity, both of which are preserved, so crispness is preserved as well.

Sharpness: Because all sharp infradistributions are crisp, $(H_{1} \cap H_{2})^{min}$ must be composed entirely of probability distributions if $H_{1}$ and $H_{2}$ are sharp. If any of the probability distributions in $(H_{1} \cap H_{2})^{min}$ aren't supported on $C_{1}$ (the compact set associated with the sharp infradistribution $H_{1}$ ), then they aren't in $H_{1}$ , which is impossible. Symmetric arguments apply to $H_{2}$ . Thus, $(H_{1} \cap H_{2})^{min}$ only has probability distributions supported on $C_{1} \cap C_{2}$ . If there was any probability distribution supported on that set that was missing from $(H_{1} \cap H_{2})^{min}$ , then it'd be present in $H_{1}$ and $H_{2}$ , and thus present in $H_{1} \cap H_{2}$ , and minimal, so we have a contradiction. Therefore $(H_{1} \cap H_{2})^{min}$ consists of all probability distributions supported on $C_{1} \cap C_{2}$ which is a compact set, so the supremum is sharp as well.

Proposition 18: If a family of infradistributions $h_{i}$ is directifiable, then ${sup}_{i} h_{i}$ (defined as the functional corresponding to the set $⋂_{i} H_{i}$ ) exists and is an infradistribution. Further, for all conditions listed in the table, if all the $h_{i}$ fulfill them, then ${sup}_{i} h_{i}$ fulfills the same property.

A family of infradistributions being directifiable is equivalent to "for any collection of finitely many infradistributions, the supremum exists". We also know that the supremum is exactly equivalent to set intersection. So, we'll show that directifiability (any collection of finitely many infradistributions has a supremum) implies that the intersection of all the infradistribution sets has the exact properties of a set-form infradistribution.

We have six properties to check. Nonemptiness, normalization (the existence of a point $(λ μ, 0)$ , existence of a point $(λ μ, b)$ with $λ + b = 1$ and nonexistence of points with $λ + b < 1$ ), closure, convexity, upper-completion, and compact-projection (the measure components of the infradistribution are contained in a compact set of measures).

For closure, it's the intersection of closed sets, so it's closed. For convexity, it's the intersection of convex sets, so it's convex. For upper-completion, it's the intersection of upper-complete sets, so it's upper-complete. For compact-projection, the measure components of the countable intersection are contained within the countable intersection of the sets of measure components, which is contained in a compact set, so it fulfills that property too.

This just leaves nonemptiness and normalization. We'll show normalization, which automatically implies nonemptiness. The nonexistence of points with $λ + b < 1$ is definitely not preserved under intersection.

However, the compact-projection property means that for any infradistribution set $H_{i}$ , the intersection of it with the surface of a-measures where $λ + b = 1$ is compact, so we're intersecting a bunch of compact sets. Due to the existence of supremum infradistributions for each collection of finitely many infradistributions (directifiability), we have the nonempty finite intersection property needed to conclude that the intersection of compact sets is nonempty. The same argument applies to the existence of a point with $b = 0$ . The presence of those two points witnesses nonemptiness and normalization.

These are the last two conditions we needed to conclude the set represents an infradistribution, so the infinite supremum exists and is the infradistribution we need.

For preservation of the various properties, we can just reuse the arguments from Proposition 17 with only trivial modifications.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

4

LBIT Proofs 2: Propositions 10-18

4