Quasi-maximum likelihood estimate

In statistics a quasi-maximum likelihood estimate (QMLE), also known as a pseudo-likelihood estimate or a composite likelihood estimate, is an estimate of a parameter θ in a statistical model that is formed by maximizing a function that is related to the logarithm of the likelihood function, but in discussing the consistency and (asymptotic) variance-covariance matrix, we assume some parts of the distribution may be mis-specified.[1][2] In contrast, the maximum likelihood estimate maximizes the actual log likelihood function for the data and model. The function that is maximized to form a QMLE is often a simplified form of the actual log likelihood function. A common way to form such a simplified function is to use the log-likelihood function of a misspecified model that treats certain data values as being independent, even when in actuality they may not be. This removes any parameters from the model that are used to characterize these dependencies. Doing this only makes sense if the dependency structure is a nuisance parameter with respect to the goals of the analysis.

As long as the quasi-likelihood function that is maximized is not oversimplified, the QMLE (or composite likelihood estimate) is consistent and asymptotically normal. It is less efficient than the maximum likelihood estimate, but may only be slightly less efficient if the quasi-likelihood is constructed so as to minimize the loss of information relative to the actual likelihood.[3] Standard approaches to statistical inference that are used with maximum likelihood estimates, such as the formation of confidence intervals, and statistics for model comparison,[4] can be generalized to the quasi-maximum likelihood setting.

Pooled QMLE for Poisson models

Pooled QMLE is a technique that allows estimating parameters when panel data is available with Poisson outcomes. For instance, one might have information on the number of patents files by a number of different firms over time. Pooled QMLE does not necessarily contain unobserved effects (which can be either random effects or fixed effects), and the estimation method is mainly proposed for these purposes. The computational requirements are less stringent, especially compared to fixed-effect Poisson models, but the trade off is the possibly strong assumption of no unobserved heterogeneity. Pooled refers to pooling the data over the different time periods 'T, while QMLE refers to the quasi-maximum likelihood technique.

The Poisson distribution of given is specified as follows:[5]

the starting point for Poisson pooled QMLE is the conditional mean assumption. Specifically, we assume that for some in a compact parameter space B, the conditional mean is given by[5]

The compact parameter space condition is imposed to enable the use of M-estimation techniques, while the conditional mean reflects the fact that the population mean of a Poisson process is the parameter of interest. In this particular case, the parameter governing the Poisson process is allowed to vary with respect to the vector .[5] The function m can, in principle, change over time even though it is often specified as static over time.[6] Note that only the conditional mean function is specified, and we will get consistent estimates of as long as this mean condition is correctly specified. This leads to the following first order condition, which represents the quasi-log likelihood for the pooled Poisson estimation:[5]

A popular choice is , as Poisson processes are defined over the positive real line.[6] This reduces the conditional moment to an exponential index function, where is the linear index and exp is the link function.[7]

See also

References

  1. Lindsay, Bruce G. (1988). "Composite likelihood methods". Statistical inference from stochastic processes (Ithaca, NY, 1987). Contemporary Mathematics. 80. Providence, RI: American Mathematical Society. pp. 221–239. doi:10.1090/conm/080/999014. MR 0999014.
  2. MacKinnon, James (2004). Econometric Theory and Methods. New York, New York: Oxford University Press. ISBN 978-0-19-512372-2.
  3. Cox, D.R.; Reid, Nancy (2004). "A note on pseudo-likelihood constructed from marginal densities". Biometrika. 91 (3): 729–737. CiteSeerX 10.1.1.136.7476. doi:10.1093/biomet/91.3.729.
  4. Varin, Cristiano; Vidoni, Paolo (2005). "A note on composite likelihood inference and model selection" (PDF). Biometrika. 92 (3): 519–528. doi:10.1093/biomet/92.3.519.
  5. Cameron, C. A. and P. K. Trivedi (2015) Count Panel Data, Oxford Handbook of Panel Data, ed. by B. Baltagi, Oxford University Press, pp. 233–256
  6. Wooldridge, J. (2002): Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.
  7. McCullagh, P. and J. A. Nelder (1989): Generalized Linear Models, CRC Monographs on Statistics and Applied Probability (Book 37), 2nd Edition, Chapman and Hall, London.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.