Statistical population

Source: Wikipedia, the free encyclopedia.

In statistics, a population is a set of similar items or events which is of interest for some question or experiment.[1] A statistical population can be a group of existing objects (e.g. the set of all stars within the Milky Way galaxy) or a hypothetical and potentially infinite group of objects conceived as a generalization from experience (e.g. the set of all possible hands in a game of poker).[2] A common aim of statistical analysis is to produce information about some chosen population.[3]

In

sample statistics
.

Mean

The population mean, or population

discrete probability distribution
of a random variable X, the mean is equal to the sum over every possible value weighted by the probability of that value; that is, it is computed by taking the product of each possible value x of X and its probability p(x), and then adding all these products together, giving .
continuous probability distribution. Not every probability distribution has a defined mean (see the Cauchy distribution
for an example). Moreover, the mean can be infinite for some distributions.

For a finite population, the population mean of a property is equal to the arithmetic mean of the given property, while considering every member of the population. For example, the population mean height is equal to the sum of the heights of every individual—divided by the total number of individuals. The

sample mean may differ from the population mean, especially for small samples. The law of large numbers states that the larger the size of the sample, the more likely it is that the sample mean will be close to the population mean.[8]

Sub population

A subset of a population that shares one or more additional properties is called a sub population. For example, if the population is all Egyptian people, a sub population is all Egyptian males; if the population is all pharmacies in the world, a sub population is all pharmacies in Egypt. By contrast, a sample is a subset of a population that is not chosen to share any additional property.

Descriptive statistics may yield different results for different sub populations. For instance, a particular medicine may have different effects on different sub populations, and these effects may be obscured or dismissed if such special sub populations are not identified and examined in isolation.

Similarly, one can often estimate parameters more accurately if one separates out sub populations: the distribution of heights among people is better modeled by considering men and women as separate sub populations, for instance.

Populations consisting of sub populations can be modeled by

bimodal distribution; otherwise, it simply has a wide peak. Further, it will exhibit overdispersion
relative to a single normal distribution with the given variation. Alternatively, given two sub populations with the same mean but different standard deviations, the overall population will exhibit high kurtosis, with a sharper peak and heavier tails (and correspondingly shallower shoulders) than a single distribution.

See also

References

  1. ^ "Glossary of statistical terms: Population". Statistics.com. Retrieved 22 February 2016.
  2. ^ Weisstein, Eric W. "Statistical population". MathWorld.
  3. ISBN 978-0-7167-4773-4. Archived from the original
    on 2005-02-09.
  4. ^ "Glossary of statistical terms: Sample". Statistics.com. Retrieved 22 February 2016.
  5. .
  6. ^ Elementary Statistics by Robert R. Johnson and Patricia J. Kuby, p. 279
  7. ^ Weisstein, Eric W. "Population Mean". mathworld.wolfram.com. Retrieved 2020-08-21.
  8. ^ Schaum's Outline of Theory and Problems of Probability by Seymour Lipschutz and Marc Lipson, p. 141

External links