An equivalence result for moment equations when data are missing at random

ISSN 2475-4269

CN 31-2182/O1

Valentin Patilea

Univ Rennes, Ensai, CNRS, CREST-UMR 9194, Rennes, France

valentin.patilea@ensai.fr

Pages 199-207 | Received 19 Dec. 2018, Accepted 21 Sep. 2019, Published online: 09 Oct. 2019,

Abstract
Full Article
References
Citations

ABSTRACT

We consider general statistical models defined by moment equations when data are missing at random. Using the inverse probability weighting, such a model is shown to be equivalent with a model for the observed variables only, augmented by a moment condition defined by the missing mechanism. Our framework covers a large class of parametric and semiparametric models where we allow for missing responses, missing covariates and any combination of them. The equivalence result is stated under minimal technical conditions and sheds new light on various aspects of interest in the missing data literature, as for instance the efficiency bounds and the construction of the efficient estimators, the restricted estimators and the imputation.

References

Ai, C., & Chen, X. (2003). Efficient estimation of models with conditional moment restrictions containing unknown functions. Econometrica, 71, 1795–1843. doi: 10.1111/1468-0262.00470 [Crossref], [Web of Science ®], [Google Scholar]
Ai, C., & Chen, X. (2007). Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables. Journal of Econometrics, 141, 5–43. doi: 10.1016/j.jeconom.2007.01.013 [Crossref], [Web of Science ®], [Google Scholar]
Ai, C., & Chen, X. (2012). The semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions. Journal of Econometrics, 170, 442–457. Thirtieth Anniversary of Generalized Method of Moments. doi: 10.1016/j.jeconom.2012.05.015 [Crossref], [Web of Science ®], [Google Scholar]
Chen, X., Hong, H., & Tarozzi, A. (2008). Semiparametric efficiency in GMM models with auxiliary data. The Annals of Statistics, 36, 808–843. doi: 10.1214/009053607000000947 [Crossref], [Web of Science ®], [Google Scholar]
Chen, S. X., & Van Keilegom, I. (2013). Estimation in semiparametric models with missing data. Annals of the Institute of Statistical Mathematics, 65, 785–805. doi: 10.1007/s10463-012-0393-6 [Crossref], [Web of Science ®], [Google Scholar]
Chen, X., Wan, A. T. K., & Zhou, Y. (2014). Efficient quantile regression analysis with missing observations. Journal of the American Statistical Association, 110(510), 723–741. doi: 10.1080/01621459.2014.928219 [Taylor & Francis Online], [Web of Science ®], [Google Scholar]
Chen, X., Wan, A. T. K., & Zhou, Y. (2015). Efficient quantile regression analysis with missing observations. Journal of the American Statistical Association, 110, 723–741. doi: 10.1080/01621459.2014.928219 [Taylor & Francis Online], [Web of Science ®], [Google Scholar]
Cheng, P. E. (1994). Nonparametric estimation of mean functionals with data missing at random. Journal of the American Statistical Association, 89, 81–87. doi: 10.1080/01621459.1994.10476448 [Taylor & Francis Online], [Web of Science ®], [Google Scholar]
Domínguez, M. A., & Lobato, I. N. (2004). Consistent estimation of models defined by conditional moment restrictions. Econometrica, 72, 1601–1615. doi: 10.1111/j.1468-0262.2004.00545.x [Crossref], [Web of Science ®], [Google Scholar]
Graham, B. S. (2011). Efficiency bounds for missing data models with semiparametric restrictions. Econometrica, 79, 437–452. doi: 10.3982/ECTA7379 [Crossref], [Web of Science ®], [Google Scholar]
Heitjan, D. F., & Rubin, D. B. (1991). Ignorability and coarse data. The Annals of Statistics, 19, 2244–2253. doi: 10.1214/aos/1176348396 [Crossref], [Web of Science ®], [Google Scholar]
Hristache, M., & Patilea, V. (2016). Semiparametric efficiency bounds for conditional moment restriction models with different conditioning variables. Econometric Theory, 32, 917–946. doi: 10.1017/S0266466615000080 [Crossref], [Web of Science ®], [Google Scholar]
Hristache, M., & Patilea, V. (2017). Conditional moment models with data missing at random. Biometrika, 104, 735–742. doi: 10.1093/biomet/asx025 [Crossref], [Web of Science ®], [Google Scholar]
Lavergne, P., & Patilea, V. (2013). Smooth minimum distance estimation and testing with conditional estimating equations: uniform in bandwidth theory. Journal of Econometrics, 177, 47–59. doi: 10.1016/j.jeconom.2013.05.006 [Crossref], [Web of Science ®], [Google Scholar]
Little, R., & Rubin, D. (2002). Statistical analysis with missing data. Wiley series in probability and mathematical statistics. Probability and mathematical statistics. John Wiley & Sons, Inc., Hoboken, New Jersey. [Google Scholar]
Müller, U. U. (2009). Estimating linear functionals in nonlinear regression with responses missing at random. The Annals of Statistics, 37, 2245–2277. doi: 10.1214/08-AOS642 [Crossref], [Web of Science ®], [Google Scholar]
Prokhorov, A., & Schmidt, P. (2009). GMM redundancy results for general missing data problems. Journal of Econometrics, 151, 47–55. doi: 10.1016/j.jeconom.2009.03.010 [Crossref], [Web of Science ®], [Google Scholar]
Robins, J. M., & Gill, R. D. (1997). Non-response models for the analysis of non-monotone ignorable missing data. Statistics in Medicine, 16, 39–56. doi: 10.1002/(SICI)1097-0258(19970115)16:1<39::AID-SIM535>3.0.CO;2-D [Crossref], [Web of Science ®], [Google Scholar]
Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika, 70, 41–55. doi: 10.1093/biomet/70.1.41 [Crossref], [Web of Science ®], [Google Scholar]
Rubin, D. B. (1976). Inference and missing data. Biometrika, 63, 581–592. doi: 10.1093/biomet/63.3.581 [Crossref], [Web of Science ®], [Google Scholar]
Tan, Z. (2011). Efficient restricted estimators for conditional mean models with missing data. Biometrika, 98, 663–684. doi: 10.1093/biomet/asr007 [Crossref], [Web of Science ®], [Google Scholar]
Tsiatis, A. (2007). Semiparametric theory and missing data. New York: Springer-Verlag. [Google Scholar]
van der Laan, M. J., & Robins, J. M. (2003). Unified methods for censored longitudinal data and causality. New York: Springer-Verlag. [Crossref], [Google Scholar]
Wang, D., & Chen, S. X. (2009). Empirical likelihood for estimating equations with missing values. The Annals of Statistics, 37, 490–517. doi: 10.1214/07-AOS585 [Crossref], [Web of Science ®], [Google Scholar]
Wei, Y., Ma, Y., & Carroll, R. J. (2012). Multiple imputation in quantile regression. Biometrika, 99, 423–438. doi: 10.1093/biomet/ass007 [Crossref], [Web of Science ®], [Google Scholar]
Wooldridge, J. M. (2007). Inverse probability weighted estimation for general missing data problems. Journal of Econometrics, 141, 1281–1301. doi: 10.1016/j.jeconom.2007.02.002 [Crossref], [Web of Science ®], [Google Scholar]

Archives

References

Authors

About the Journal

Links

Search

Archives