Bonferroni correction

In statistics, the Bonferroni correction is a method to counteract the multiple comparisons problem.

Background

The method is named for its use of the

Bonferroni inequalities.^[1]

confidence intervals was proposed by Olive Jean Dunn.^[2]

The Bonferroni correction compensates for that increase by testing each individual hypothesis at a significance level of $\alpha /m$ , where $\alpha$ is the desired overall alpha level and $m$ is the number of hypotheses.[4] For example, if a trial is testing $m=20$ hypotheses with a desired overall $\alpha =0.05$ , then the Bonferroni correction would test each individual hypothesis at $\alpha =0.05/20=0.0025$ . Similarly, when constructing confidence intervals for $m$ parameters, each individual confidence interval can be computed at the $1-\alpha /m$ confidence level to achieve an overall confidence level of $1-\alpha$ .

The Bonferroni correction can also be applied as a p-value adjustment: Using that approach, instead of adjusting the alpha level, each p-value is multiplied by the number of tests (with adjusted p-values that exceed 1 then being reduced to 1), and the alpha level is left unchanged. The significance decisions reached will be the same as when using the alpha-level adjustment approach.

Definition

Let $H_{1},\ldots ,H_{m}$ be a family of null hypotheses and let $p_{1},\ldots ,p_{m}$ be their corresponding p-values. Let $m$ be the total number of null hypotheses, and let $m_{0}$ be the number of true null hypotheses (which is presumably unknown to the researcher). The family-wise error rate (FWER) is the probability of rejecting at least one true $H_{i}$ , that is, of making at least one

type I error

. The Bonferroni correction rejects the null hypothesis for each

p_{i}\leq {\frac {\alpha }{m}}

, thereby controlling the FWER at

\leq \alpha

. Proof of this control follows from Boole's inequality, as follows:

{\text{FWER}}=P\left\{\bigcup _{i=1}^{m_{0}}\left(p_{i}\leq {\frac {\alpha }{m}}\right)\right\}\leq \sum _{i=1}^{m_{0}}\left\{P\left(p_{i}\leq {\frac {\alpha }{m}}\right)\right\}=m_{0}{\frac {\alpha }{m}}\leq \alpha .

This control does not require any assumptions about dependence among the p-values or about how many of the null hypotheses are true.^[5]

Extensions

Generalization

Rather than testing each hypothesis at the $\alpha /m$ level, the hypotheses may be tested at any other combination of levels that add up to $\alpha$ , provided that the level of each test is decided before looking at the data.^[6] For example, for two hypothesis tests, an overall $\alpha$ of 0.05 could be maintained by conducting one test at 0.04 and the other at 0.01.

Confidence intervals

The procedure proposed by Dunn

confidence intervals

. If one establishes

m

confidence intervals, and wishes to have an overall confidence level of

1-\alpha

, each individual confidence interval can be adjusted to the level of

1-{\frac {\alpha }{m}}

.^[2]

Continuous problems

When searching for a signal in a continuous parameter space there can also be a problem of multiple comparisons, or look-elsewhere effect. For example, a physicist might be looking to discover a particle of unknown mass by considering a large range of masses; this was the case during the Nobel Prize winning detection of the Higgs boson. In such cases, one can apply a continuous generalization of the Bonferroni correction by employing Bayesian logic to relate the effective number of trials, $m$ , to the prior-to-posterior volume ratio.^[7]

Alternatives

There are alternative ways to control the

expected number of Type I errors per family (the per-family Type I error rate).^[8]

Criticism

With respect to FWER control, the Bonferroni correction can be conservative if there are a large number of tests and/or the test statistics are positively correlated.^[9]

The correction comes at the cost of increasing the probability of producing

statistical power.^[10]^[9] There is not a definitive consensus on how to define a family in all cases, and adjusted test results may vary depending on the number of tests included in the family of hypotheses.^{[citation needed]} Such criticisms apply to FWER

control in general, and are not specific to the Bonferroni correction.

References

^ Bonferroni, C. E., Teoria statistica delle classi e calcolo delle probabilità, Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commerciali di Firenze 1936
^
doi:10.1080/01621459.1961.10482090
.

ISBN 978-0-521-62394-0
.

ISBN 9781461381228
.

S2CID 22086583
.

PMID 8014990
.

S2CID 220830693
.

doi:10.22237/jmasm/1430453040
.

^
doi:10.1034/j.1600-0706.2003.12010.x
.

doi:10.1093/beheco/arh107
.

External links

Bonferroni, Sidak online calculator

Retrieved from "https://en.wikipedia.org/w/index.php?title=Bonferroni_correction&oldid=1220441047"

[1] Bonferroni, C. E., Teoria statistica delle classi e calcolo delle probabilità, Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commerciali di Firenze 1936

[Dunn1961-2] 
doi:10.1080/01621459.1961.10482090
.

[3] ISBN 978-0-521-62394-0
.

[4] ISBN 9781461381228
.

[5] S2CID 22086583
.

[pmid8014990-6] PMID 8014990
.

[Bayer2020-7] S2CID 220830693
.

[8] :10.22237/jmasm/1430453040
.

[Moran2003-9] 
doi:10.1034/j.1600-0706.2003.12010.x
.

[Nakagawa2004-10] :10.1093/beheco/arh107
.

[1]

[2]

[3]

[5]

[6]

[7]

[8]

[9]

[10]