Review Articles

MLE with datasets from populations having shared parameters

Jun Shao ,

School of Statistics, East China Normal University, Shanghai, People's Republic of China

Xinyan Wang

Department of Statistics, University of Wisconsin, Madison, WI, USA

xwang2587@wisc.edu

Pages | Received 04 Aug. 2022, Accepted 01 Feb. 2023, Published online: 04 Mar. 2023,
  • Abstract
  • Full Article
  • References
  • Citations

We consider maximum likelihood estimation with two or more datasets sampled from different populations with shared parameters. Although more datasets with shared parameters can increase statistical accuracy, this paper shows how to handle heterogeneity among different populations for correctness of estimation and inference. Asymptotic distributions of maximum likelihood estimators are derived under either regular cases where regularity conditions are satisfied or some non-regular situations. A bootstrap variance estimator for assessing performance of estimators and/or making large sample inference is also introduced and evaluated in a simulation study.

References

  • Efron, B., & Tibshirani, R. J. (1993). An introduction to the bootstrap. New York: Chapman and Halll/CRC. 
  • Kim, H. J., Wang, Z., & Kim, J. K (2021). Survey data integration for regression analysis using model calibration. arXiv 2107.06448.
  • Lohr, S. L., & Raghunathan, T. E. (2017). Combining survey data with other data sources. Statistical Science32(2), 293–312. https://doi.org/10.1214/16-STS584 
  • Merkouris, T. (2004). Combining independent regression estimators from multiple surveys. Journal of the American Statistical Association99(468), 1131–1139. https://doi.org/10.1198/016214504000000601 
  • Rao, J. N. K. (2021). On making valid inferences by integrating data from surveys and other sources. Sankhya B83(1), 242–272. https://doi.org/10.1007/s13571-020-00227-w 
  • Shao, J. (2003). Mathematical statistics. 2nd ed. Springer. 
  • Yang, S., & Kim, J. K. (2020). Statistical data integration in survey sampling: A review. Japanese Journal of Statistics and Data Science3(2), 625–650. https://doi.org/10.1007/s42081-020-00093-w 
  • Zhang, Y., Ouyang, Z., & Zhao, H. (2017). A statistical framework for data integration through graphical models with application to cancer genomics. The Annals of Applied Statistics11(1), 161–184. https://doi.org/10.1214/16-AOAS998 
  • Zieschang, K. D. (1990). Sample weighting methods and estimation of totals in the consumer expenditure survey. Journal of the American Statistical Association85(412), 986–1001. https://doi.org/10.1080/01621459.1990.10474969 

To cite this article: Jun Shao & Xinyan Wang (2023) MLE with datasets from populations having shared parameters, Statistical Theory and Related Fields, 7:3, 213-222, DOI: 10.1080/24754269.2023.2180185 To link to this article: https://doi.org/10.1080/24754269.2023.2180185