Panel analysis

Panel (data) analysis is a statistical method, widely used in social science, epidemiology, and econometrics to analyze two-dimensional (typically cross sectional and longitudinal) panel data.^[1] The data are usually collected over time and over the same individuals and then a regression is run over these two dimensions. Multidimensional analysis is an econometric method in which data are collected over more than two dimensions (typically, time, individuals, and some third dimension).^[2]

A common panel data regression model looks like $y_{it}=a+bx_{it}+\varepsilon _{it}$ , where y is the dependent variable, x is the independent variable, a and b are coefficients, i and t are indices for individuals and time. The error $\varepsilon _{it}$ is very important in this analysis. Assumptions about the error term determine whether we speak of fixed effects or random effects. In a fixed effects model, $\varepsilon _{it}$ is assumed to vary non-stochastically over $i$ or $t$ making the fixed effects model analogous to a dummy variable model in one dimension. In a random effects model, $\varepsilon _{it}$ is assumed to vary stochastically over $i$ or $t$ requiring special treatment of the error variance matrix.^[3]

Panel data analysis has three more-or-less independent approaches:

independently pooled panels;
random effects models;
fixed effects models or first differenced models.

The selection between these methods depends upon the objective of the analysis, and the problems concerning the exogeneity of the explanatory variables.

Independently pooled panels

Key assumption: There are no unique attributes of individuals within the measurement set, and no universal effects across time.

Fixed effect models

Key assumption: There are unique attributes of individuals that do not vary across time. These attributes may or may not be correlated with the individual dependent variables. To test whether fixed effects, rather than random effects, is needed, the Wu-Haussman test can be used.

Random effect models

Key assumption: There are unique, time constant attributes of individuals that are not correlated with the individual regressors. Pooled OLS can be used to derive unbiased and consistent estimates of parameters even when time constant attributes are present, but random effects will be more efficient.

Fixed effects is a feasible generalised least squares technique which is asymptotically more efficient than Pooled OLS when time constant attributes are present. Random effects adjusts for the serial correlation which is induced by unobserved time constant attributes.

References

↑ Maddala, G. S. (2001). Introduction to Econometrics (Third ed.). New York: Wiley. ISBN 0-471-49728-2.
↑ Davies, A.; Lahiri, K. (1995). "A New Framework for Testing Rationality and Measuring Aggregate Shocks Using Panel Data". Journal of Econometrics. 68 (1): 205–227. doi:10.1016/0304-4076(94)01649-K.
↑ Hsiao, C.; Lahiri, K.; Lee, L.; et al., eds. (1999). Analysis of Panels and Limited Dependent Variable Models. Cambridge: Cambridge University Press. ISBN 0-521-63169-6.

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] Maddala, G. S. (2001). Introduction to Econometrics (Third ed.). New York: Wiley. ISBN 0-471-49728-2.

[2] Davies, A.; Lahiri, K. (1995). "A New Framework for Testing Rationality and Measuring Aggregate Shocks Using Panel Data". Journal of Econometrics. 68 (1): 205–227. doi:10.1016/0304-4076(94)01649-K.

[3] Hsiao, C.; Lahiri, K.; Lee, L.; et al., eds. (1999). Analysis of Panels and Limited Dependent Variable Models. Cambridge: Cambridge University Press. ISBN 0-521-63169-6.

Panel analysis

Independently pooled panels

Fixed effect models

Random effect models

See also

References