Endogeneity (econometrics)
![]() | This article may be too technical for most readers to understand.(January 2023) |
In
Besides simultaneity, correlation between explanatory variables and the error term can arise when an unobserved or
Exogeneity versus endogeneity
In a
When the explanatory variables are not stochastic, then they are strong exogenous for all the parameters.
If the
Static models
The following are some common sources of endogeneity.
Omitted variable
In this case, the endogeneity comes from an uncontrolled
Assume that the "true" model to be estimated is
but is omitted from the regression model (perhaps because there is no way to measure it directly). Then the model that is actually estimated is
where (thus, the term has been absorbed into the error term).
If the correlation of and is not 0 and separately affects (meaning ), then is correlated with the error term .
Here, is not exogenous for and , since, given , the distribution of depends not only on and , but also on and .
Measurement error
Suppose that a perfect measure of an independent variable is impossible. That is, instead of observing , what is actually observed is where is the measurement error or "noise". In this case, a model given by
can be written in terms of observables and error terms as
Since both and depend on , they are correlated, so the OLS estimation of will be biased downward.
Measurement error in the dependent variable, , does not cause endogeneity, though it does increase the variance of the error term.
Simultaneity
Suppose that two variables are codetermined, with each affecting the other according to the following "structural" equations:
Estimating either equation by itself results in endogeneity. In the case of the first structural equation, . Solving for while assuming that results in
- .
Assuming that and are uncorrelated with ,
- .
Therefore, attempts at estimating either structural equation will be hampered by endogeneity.
Dynamic models
The endogeneity problem is particularly relevant in the context of
Let the model be y = f(x, z) + u. If the variable x is sequential exogenous for parameter , and y does not cause x in the Granger sense, then the variable x is strongly/strictly exogenous for the parameter .
Simultaneity
Generally speaking, simultaneity occurs in the dynamic model just like in the example of static simultaneity above.
See also
- Virtuous circle and vicious circle
- Heterogeneity
- Dependent and independent variables
Footnotes
- exogenous change on the demand curve.
References
- ISBN 978-0-324-66054-8.
- ISBN 0-02-365070-2.
- ISSN 1048-9843.
- ISBN 0-07-032679-7.
Further reading
- Greene, William H. (2012). Econometric Analysis (Sixth ed.). Upper Saddle River: Pearson. ISBN 978-0-13-513740-6.
- Kennedy, Peter (2008). A Guide to Econometrics (Sixth ed.). Malden: Blackwell. p. 139. ISBN 978-1-4051-8257-7.
- ISBN 0-02-365070-2.