AI ALIGNMENT FORUM
AF

Wikitags
Main
13
Refresher
1

Conditional probability: Refresher

Edited by So8res, et al. last updated 11th Oct 2016

P(left∣right) is the probability that the thing on the left is true assuming the thing on the right is true, and it's defined as P(left∧right)P(right).

Thus, P(yellow∣banana) is the probability that a banana is yellow ("the probability of yellowness given banana"), while P(banana∣yellow) is the probability that a yellow object is a banana ("the probability of banana, given yellowness").[1]

P(x∧y) is used to denote the probability of both x and y being true simultaneously (according to some probability_distribution P). P(x∣y), pronounced "the conditional probability of x, given y", is defined to be the quantity

P(x∧y)P(y).

For example, in the Diseasitis problem, P(sick∣positive) is the probability that a patient is sick given a positive test result, and it's calculated by taking the 18% patients who are sick and have positive test results, and dividing by all 42% of the patients who got positive test results. That is, P(sick∣positive) = P(sick∧positive)P(positive).

Using a frequency diagram, we can visualize P(sick∣positive) as the probability of drawing a sick result from a bag of only those people in the population who got a positive result.

diseasitis frequency

bag of 18 and 24 patients

The "given" operator in P(x∣y) tells us to assume that y is true, to restrict our attention to only possible cases where y is true, and then ask about the probability of x within those cases.

Note that P(positive∣sick) is not the same as P(sick∣positive). To find the probability that a patient has a positive result given that they're sick, we can visualize taking the 20 sick patients and putting them in a group, and then asking the probability that a randomly selected one will have a positive result, which will be 1820=0.9 — so P(positive∣sick)=90%, while P(sick∣positive)≈43%. Mixing up which one is which is an unfortunate source of of many practical errors when you're trying to do these calculations using only the formal notation, at least until you get used to it. Just remember that P(left∣right) is the probability of the thing on the left given that the thing on the right is true.

  1. ^︎

    In general, P(v) is an abbreviation of P(V=v) for some variable V, which is assumed to be known from the context. For example, P(yellow) might stand for P(ColorOfNextObjectInBag=yellow) where ColorOfNextObjectInBag is a variable in our probability_distribution P, and yellow is one of the values that that variable can take on.

Parents:
Conditional probability
1
1
Discussion0
Discussion0