Mathematical statistics

Statistics |
---|
![]() |
Part of a series on | ||
Mathematics | ||
---|---|---|
|
||
![]() | ||
Mathematical statistics is the application of
Introduction
Statistical data collection is concerned with the planning of studies, especially with the
Data analysis is divided into:
- descriptive statistics – the part of statistics that describes data, i.e. summarises the data and their typical properties.
- inferential statistics – the part of statistics that draws conclusions from data (using some model for the data): For example, inferential statistics involves selecting a model for the data, checking whether the data fulfill the conditions of a particular model, and with quantifying the involved uncertainty (e.g. using confidence intervals).
While the tools of data analysis work best on data from randomized studies, they are also applied to other kinds of data. For example, from
Topics
The following are some of the important topics in mathematical statistics:[6][7]
Probability distributions
A
A probability distribution can either be
Special distributions
- Normal distribution, the most common continuous distribution
- Bernoulli distribution, for the outcome of a single Bernoulli trial (e.g. success/failure, yes/no)
- independentoccurrences
- Negative binomial distribution, for binomial-type observations but where the quantity of interest is the number of failures before a given number of successes occurs
- Geometric distribution, for binomial-type observations but where the quantity of interest is the number of failures before the first success; a special case of the negative binomial distribution, where the number of successes is one.
- Discrete uniform distribution, for a finite set of values (e.g. the outcome of a fair die)
- Continuous uniform distribution, for continuously distributed values
- Poisson distribution, for the number of occurrences of a Poisson-type event in a given period of time
- Exponential distribution, for the time before the next Poisson-type event occurs
- Gamma distribution, for the time before the next k Poisson-type events occur
- sample variance of normally distributed samples (see chi-squared test)
- chi squared variable; useful for inference regarding the mean of normally distributed samples with unknown variance (see Student's t-test)
- Beta distribution, for a single probability (real number between 0 and 1); conjugate to the Bernoulli distribution and binomial distribution
Statistical inference
Statistical inference is the process of drawing conclusions from data that are subject to random variation, for example, observational errors or sampling variation.[8] Initial requirements of such a system of procedures for inference and induction are that the system should produce reasonable answers when applied to well-defined situations and that it should be general enough to be applied across a range of situations. Inferential statistics are used to test hypotheses and make estimations using sample data. Whereas descriptive statistics describe a sample, inferential statistics infer predictions about a larger population that the sample represents.
The outcome of statistical inference may be an answer to the question "what should be done next?", where this might be a decision about making further experiments or surveys, or about drawing a conclusion before implementing some organizational or governmental policy. For the most part, statistical inference makes propositions about populations, using data drawn from the population of interest via some form of random sampling. More generally, data about a random process is obtained from its observed behavior during a finite period of time. Given a parameter or hypothesis about which one wishes to make inference, statistical inference most often uses:
- a statistical model of the random process that is supposed to generate the data, which is known when randomization has been used, and
- a particular realization of the random process; i.e., a set of data.
Regression
In
Many techniques for carrying out regression analysis have been developed. Familiar methods, such as linear regression, are parametric, in that the regression function is defined in terms of a finite number of unknown parameters that are estimated from the data (e.g. using ordinary least squares). Nonparametric regression refers to techniques that allow the regression function to lie in a specified set of functions, which may be infinite-dimensional.
Nonparametric statistics
Nonparametric statistics are values calculated from data in a way that is not based on parameterized families of probability distributions. They include both descriptive and inferential statistics. The typical parameters are the expectations, variance, etc. Unlike parametric statistics, nonparametric statistics make no assumptions about the probability distributions of the variables being assessed.[9]
Non-parametric methods are widely used for studying populations that take on a ranked order (such as movie reviews receiving one to four stars). The use of non-parametric methods may be necessary when data have a
As non-parametric methods make fewer assumptions, their applicability is much wider than the corresponding parametric methods. In particular, they may be applied in situations where less is known about the application in question. Also, due to the reliance on fewer assumptions, non-parametric methods are more robust.
One drawback of non-parametric methods is that since they do not rely on assumptions, they are generally less
Another justification for the use of non-parametric methods is simplicity. In certain cases, even when the use of parametric methods is justified, non-parametric methods may be easier to use. Due both to this simplicity and to their greater robustness, non-parametric methods are seen by some statisticians as leaving less room for improper use and misunderstanding.
Statistics, mathematics, and mathematical statistics
Mathematical statistics is a key subset of the discipline of
Mathematicians and statisticians like
See also
References
- ISBN 978-0-387-21718-5.
- ISBN 0824706609.
- ISBN 0387945466.
- ISBN 978-0-521-67105-7
- ^ ISBN 978-0-521-12390-7.
- ^ Hogg, R. V., A. Craig, and J. W. McKean. "Intro to Mathematical Statistics." (2005).
- ^ Larsen, Richard J. and Marx, Morris L. "An Introduction to Mathematical Statistics and Its Applications" (2012). Prentice Hall.
- ISBN 978-0-19-954145-4
- ^ "Research Nonparametric Methods". Carnegie Mellon University. Retrieved August 30, 2022.
- ^ a b "Nonparametric Tests". sphweb.bumc.bu.edu. Retrieved 2022-08-31.
- )
- ^ Wald, Abraham (1950). Statistical Decision Functions. John Wiley and Sons, New York.
- ISBN 0-387-94919-4.
- ^
ISBN 0-387-98502-6.
- ^ Bickel, Peter J.; Doksum, Kjell A. (2001). Mathematical Statistics: Basic and Selected Topics. Vol. 1 (Second (updated printing 2007) ed.). Pearson Prentice-Hall.
- ISBN 0-387-96307-3.
- ^ Liese, Friedrich & Miescke, Klaus-J. (2008). Statistical Decision Theory: Estimation, Testing, and Selection. Springer.
Further reading
- ISBN 90-5699-018-7
- Virtual Laboratories in Probability and Statistics (Univ. of Ala.-Huntsville)
- StatiBot, interactive online expert system on statistical tests.
- Ray, Manohar; Sharma, Har Swarup (1966). Mathematical Statistics. Ram Prasad & Sons. ISBN 978-9383385188