Posterior probability

Source: Wikipedia, the free encyclopedia.

The posterior probability is a type of

Bayes' rule.[1] From an epistemological perspective, the posterior probability contains everything there is to know about an uncertain proposition (such as a scientific hypothesis, or parameter values), given prior knowledge and a mathematical model describing the observations available at a particular time.[2] After the arrival of new information, the current posterior probability may serve as the prior in another round of Bayesian updating.[3]

In the context of

highest posterior density interval (HPDI).[4] But while conceptually simple, the posterior distribution is generally not tractable and therefore needs to be either analytically or numerically approximated.[5]

Definition in the distributional case

In variational Bayesian methods, the posterior probability is the probability of the parameters given the evidence , and is denoted .

It contrasts with the likelihood function, which is the probability of the evidence given the parameters: .

The two are related as follows:

Given a prior belief that a probability distribution function is and that the observations have a likelihood , then the posterior probability is defined as

,[6]

where is the normalizing constant and is calculated as

for continuous , or by summing over all possible values of for discrete .[7]

The posterior probability is therefore

proportional to the product Likelihood · Prior probability.[8]

Example

Suppose there is a school with 60% boys and 40% girls as students. The girls wear trousers or skirts in equal numbers; all boys wear trousers. An observer sees a (random) student from a distance; all the observer can see is that this student is wearing trousers. What is the probability this student is a girl? The correct answer can be computed using Bayes' theorem.

The event is that the student observed is a girl, and the event is that the student observed is wearing trousers. To compute the posterior probability , we first need to know:

Given all this information, the posterior probability of the observer having spotted a girl given that the observed student is wearing trousers can be computed by substituting these values in the formula:

An intuitive way to solve this is to assume the school has N students. Number of boys = 0.6N and number of girls = 0.4N. If N is sufficiently large, total number of trouser wearers = 0.6N+ 50% of 0.4N. And number of girl trouser wearers = 50% of 0.4N. Therefore, in the population of trousers, girls are (50% of 0.4N)/(0.6N+ 50% of 0.4N) = 25%. In other words, if you separated out the group of trouser wearers, a quarter of that group will be girls. Therefore, if you see trousers, the most you can deduce is that you are looking at a single sample from a subset of students where 25% are girls. And by definition, chance of this random student being a girl is 25%. Every Bayes-theorem problem can be solved in this way.[9]

Calculation

The posterior probability distribution of one

prior probability distribution by the likelihood function, and then dividing by the normalizing constant
, as follows:

gives the posterior probability density function for a random variable given the data , where

  • is the prior density of ,
  • is the likelihood function as a function of ,
  • is the normalizing constant, and
  • is the posterior density of given the data .[10]

Credible interval

Posterior probability is a conditional probability conditioned on randomly observed data. Hence it is a random variable. For a random variable, it is important to summarize its amount of uncertainty. One way to achieve this goal is to provide a credible interval of the posterior probability.[11]

Classification

In

class-membership probabilities
. While statistical classification methods by definition generate posterior probabilities, Machine Learners usually supply membership values which do not induce any probabilistic confidence. It is desirable to transform or rescale membership values to class-membership probabilities, since they are comparable and additionally more easily applicable for post-processing.[12]

See also

References

  1. .
  2. .
  3. ^ Etz, Alex (2015-07-25). "Understanding Bayes: Updating priors via the likelihood". The Etz-Files. Retrieved 2022-08-18.
  4. .
  5. .
  6. .
  7. ISBN 978-1-4398-4095-5.{{cite book}}: CS1 maint: multiple names: authors list (link
    )
  8. ^ Ross, Kevin. Chapter 8 Introduction to Continuous Prior and Posterior Distributions | An Introduction to Bayesian Reasoning and Methods.
  9. ^ "Bayes' theorem - C o r T e x T". sites.google.com. Retrieved 2022-08-18.
  10. ^ "Posterior probability - formulasearchengine". formulasearchengine.com. Retrieved 2022-08-19.
  11. ^ Clyde, Merlise; Çetinkaya-Rundel, Mine; Rundel, Colin; Banks, David; Chai, Christine; Huang, Lizzy. Chapter 1 The Basics of Bayesian Statistics | An Introduction to Bayesian Thinking.
  12. S2CID 199007973
    .

Further reading