Embedded Naive Bayes

[-]Vanessa Kosoy6y30

This seems interesting, but I think there are some assumptions that have been left out? What prevents us from taking $G$ and $g$ to be constants, which would give solutions for any $f$ ?

[-]johnswentworth6y20

I am indeed leaving out some assumptions, mainly because I am not yet convinced of which assumptions are "right". The simplest assumption - used by Aczel - is that $G$ and $g$ are monotonic. But that's usually chosen more for mathematical convenience than for any principled reason, as far as I can tell. We certainly want some assumptions which rule out the trivial solution, but I'm not sure what they should be.

[-]Rohin Shah6y20

What do you mean by embedded here? It seems you are asking the question "does a particular input-output behavior / computation correspond to some Naive Bayes model", which is not what I would intuitively think of as "embedded Naive Bayes".

[-]johnswentworth6y20

Here's the use case I have in mind. We have some neural network or biological cell or something performing computation. It's been optimized via gradient descent/evolution, and we have some outside-view arguments saying that optimal reasoning should approximate Bayesian inference. We also know that the "true" behavior of the environment is causal - so optimal reasoning for our system should approximate Bayesian reasoning on some causal model of the environment.

The problem, then, is to go check whether the system actually is approximating Bayesian reasoning over some causal model, and what that causal model is. In other words, we want to check whether the system has a particular causal model (e.g. a Naive Bayes model) of its input data embedded within it.

What do you imagine "embedded" to mean?

[-]Rohin Shah6y30

I usually imagine the problems of embedded agency (at least when I'm reading LW/AF), where the central issue is that the agent is a part of its environment (in contrast to the Cartesian model, where there is a clear, bright line dividing the agent and the environment). Afaict, "embedded Naive Bayes" is something that makes sense in a Cartesian model, which I wasn't expecting.

It's not that big a deal, but if you want to avoid that confusion, you might want to change the word "embedded". I kind of want to say "The Intentional Stance towards Naive Bayes", but that's not right either.

[-]johnswentworth6y30

Ok, that's what I was figuring. My general position is that the problems of agents embedded in their environment reduce to problems of abstraction, i.e. world-models embedded in computations which do not themselves obviously resemble world-models. At some point I'll probably write that up in more detail, although the argument remains informal for now.

The immediately important point is that, while the OP makes sense in a Cartesian model, it also makes sense without a Cartesian model. We can just have some big computation, and pick a little chunk of it at random, and say "does this part here embed a Naive Bayes model?" In other words, it's the sort of thing you could use to detect agenty subsystems, without having a Cartesian boundary drawn in advance.

The Embedded Naive Bayes Equation

Let’s formalize this a bit.

We have some system which takes in data

x

, computes some stuff, and spits out some

f (x)

. We want to know whether a Naive Bayes model is embedded in

f (x)

. Conceptually, we imagine that

f (x)

parameterizes a probability distribution over some unobserved parameter

θ

- we’ll write

P [θ; f (x)]

, where the “;” is read as “parameterized by”. For instance, we could imagine a normal distribution over

θ

, in which case

f (x)

might be the mean and variance (or any encoding thereof) computed from our input data. In our earthquake example,

θ

is a binary variable, so

f (x)

is just some encoding of the probability that

θ = T r u e

Now let’s write the actual equation defining an embedded Naive Bayes model. We assert that

P [θ; f (x)]

is the same as

P [θ | x]

under the model, i.e.

P [θ; f (x)] = P [θ | x] = \frac{1}{Z} P [θ] \prod_{i} P [x_{i} | θ]

We can transform to log odds form to get rid of the Z:

L [θ; f (x)] = l n \frac{P [θ]}{P [\sim θ]} + \sum_{i} l n \frac{P [x_{i} | θ]}{P [x_{i} | \sim θ]}

Let’s pause for a moment and go through that equation. We know the function

f (x)

, and we want the equation to hold for all values of

x

θ

is some hypothetical thing out in the environment - we don’t know what it corresponds to, we just hypothesize that the system is modelling something it can’t directly observe. As with

x

, we want the equation to hold for all values of

θ

. The unknowns in the equation are the probability functions

P [θ; f (x)]

P [θ]

and

P [x_{i} | θ]

. To make it clear what’s going on, let’s remove the probability notation for a moment, and just use functions

G

and

{g_{i}}

, with

θ

written as a subscript:

\forall θ, x : G_{θ} (f (x)) = c_{θ} + \sum_{i} g_{θ i} (x_{i})

This is a functional equation: for each value of

θ

, we want to find functions

G

{g_{i}}

, and a constant

c

such that the equation holds for all possible

x

values. The solutions

G

and

{g_{i}}

can then be decoded to give our probability functions

P [θ; f (x)]

and

P [x_{i} | θ]

, while

c

can be decoded to give our prior

P [θ]

. Each possible

θ

-value corresponds to a different set of solutions

G_{θ}

{g_{θ i}}

c_{θ}

This particular functional equation is a variant of Pexider’s equation; you can read all about it in Aczel’s Functional Equations and Their Applications, chapter 3. For our purposes, the most important point is: depending on the function

f

, the equation may or may not have a solution. In other words, there is a meaningful sense in which some functions

f (x)

do embed a Naive Bayes model, and others do not. Our seismologist’s procedure does embed a Naive Bayes model: let

G

be the identity function,

c

be zero, and

g_{i} (x_{i}) = s_{i}^{x_{i}}

, and we have a solution to the embedding equation with

f (x)

given by our seismologist’s add-all-the-scores calculation (although this is not the only solution). On the other hand, a procedure computing

f (x) = x_{1}^{x_{2}^{x_{3}}}

for real-valued inputs

x_{1}

x_{2}

x_{3}

would not embed a Naive Bayes model: with this

f (x)

, the embedding equation would not have any solutions.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

7

7

The Embedded Naive Bayes Equation