多元零膨胀几何分布的参数估计
Parameter Estimation for the Multivariate Zero-Inflated Geometric Distribution
摘要: 本文探讨了多元计数数据中常见的零膨胀现象,并提出了一个针对多元零膨胀几何(ZIG)分布的理论与回归框架。该模型能够有效区分结构性零和额外零,适用于多元建模场景。文章系统推导了该分布的联合概率函数、累积分布函数、矩特征以及条件分布性质,基于期望最大化(EM)算法和Fisher评分算法建立了参数估计方法,并讨论了假设检验和置信区间构建问题。模拟研究表明,所提出的估计方法具有良好的有限样本性质。
Abstract: This paper addresses the common zero inflation phenomenon in multivariate count data and proposes a theoretical and regression framework for a multivariate zero-inflated geometric (ZIG) distribution. This model can effectively distinguish between structural zeros and extra zeros and is applicable to multivariate modeling scenarios. The article systematically derived the joint probability function, cumulative distribution function, moment characteristics, and conditional distribution properties of this distribution, established parameter estimation methods based on the EM algorithm and Fisher scoring algorithm, and discussed hypothesis testing and confidence interval construction issues. Simulation studies show that the proposed estimation method has good finite sample properties.
文章引用:连禹晴, 卢飞龙. 多元零膨胀几何分布的参数估计[J]. 应用数学进展, 2026, 15(5): 544-557. https://doi.org/10.12677/aam.2026.155250

参考文献

[1] Cameron, A.C. and Trivedi, P.K. (2013) Regression Analysis of Count Data. 2nd Edition, Cambridge University Press. [Google Scholar] [CrossRef
[2] Warton, D.I. (2005) Many Zeros Does Not Mean Zero Inflation: Comparing the Goodness-Of-Fit of Parametric Models to Multivariate Abundance Data. Environmetrics, 16, 275-289. [Google Scholar] [CrossRef
[3] Böhning, D., Dietz, E., Schlattmann, P., Mendonça, L. and Kirchner, U. (1999) The Zero-Inflated Poisson Model and the Decayed, Missing and Filled Teeth Index in Dental Epidemiology. Journal of the Royal Statistical Society Series A: Statistics in Society, 162, 195-209. [Google Scholar] [CrossRef
[4] Welsh, A.H., Cunningham, R.B., Donnelly, C.F. and Lindenmayer, D.B. (1996) Modelling the Abundance of Rare Species: Statistical Models for Counts with Extra Zeros. Ecological Modelling, 88, 297-308. [Google Scholar] [CrossRef
[5] Crépon, B. and Duguet, E. (1997) Estimating the Innovation Function from Patent Numbers: GMM on Count Panel Data. Journal of Applied Econometrics, 12, 243-263. [Google Scholar] [CrossRef
[6] Hu, M., Pavlicova, M. and Nunes, E.V. (2011) Zero-Inflated and Hurdle Models of Count Data with Extra Zeros: Examples from an HIV-Risk Reduction Intervention Trial. The American Journal of Drug and Alcohol Abuse, 37, 367-375. [Google Scholar] [CrossRef] [PubMed]
[7] Min, Y. and Agresti, A. (2005) Random Effect Models for Repeated Measures of Zero-Inflated Count Data. Statistical Modelling, 5, 1-19. [Google Scholar] [CrossRef
[8] Greene, W.H. (1994) Accounting for Excess Zeros and Sample Selection in Poisson and Negative Binomial Regression Models. New York University.
[9] Lambert, D. (1992) Zero-Inflated Poisson Regression, with an Application to Defects in Manufacturing. Technometrics, 34, 1-14. [Google Scholar] [CrossRef
[10] Ridout, M., Demétrio, C.G.B. and Hinde, J. (1998) Models for Count Data with Many Zeros. Proceedings of the XIX International Biometric Conference, Cape Town, 14-18 December 1998, 179-192.
[11] Bakouch, H.S. and Ristić, M.M. (2010) Zero Truncated Poisson Integer-Valued AR(1) Model. Metrika, 72, 265-280. [Google Scholar] [CrossRef
[12] Jazi, M.A., Jones, G. and Lai, C. (2012) First‐Order Integer Valued AR Processes with Zero Inflated Poisson Innovations. Journal of Time Series Analysis, 33, 954-963. [Google Scholar] [CrossRef
[13] Dietz, E. and Böhning, D. (2000) On Estimation of the Poisson Parameter in Zero-Modified Poisson Models. Computational Statistics & Data Analysis, 34, 441-459. [Google Scholar] [CrossRef
[14] Srisuradetchai, P. and Dangsupa, K. (2023) On Interval Estimation of the Geometric Parameter in a Zero-Inflated Geometric Distribution. Thailand Statistician, 21, 93-109.
[15] Mallick, A. and Joshi, R. (2018) Parameter Estimation and Application of Generalized Inflated Geometric Distribution. Journal of Statistical Theory and Applications, 17, 491-519. [Google Scholar] [CrossRef
[16] Pandya, M., Pandya, H. and Pandya, S. (2012) Bayesian Inference on Mixture of Geometric with Degenerate Distribution: Zero Inflated Geometric Distribution. International Journal of Research and Reviews in Applied Sciences, 13, 53-66.
[17] Liu, Y. and Tian, G.L. (2015) Type I Multivariate Zero-Inflated Poisson Distribution with Applications. Computational Statistics & Data Analysis, 83, 200-222. [Google Scholar] [CrossRef
[18] Zhang, P., Chen, Z., Tzougas, G., et al. (2025) Multivariate Zero-Inflated INAR(1) Model with an Application in Automobile Insurance. North American Actuarial Journal, 29, 310-328. [Google Scholar] [CrossRef
[19] Self, S.G. and Liang, K. (1987) Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions. Journal of the American Statistical Association, 82, 605-610. [Google Scholar] [CrossRef
[20] Feng, Z. and McCulloch, C.E. (1992) Statistical Inference Using Maximum Likelihood Estimation and the Generalized Likelihood Ratio When the True Parameter Is on the Boundary of the Parameter Space. Statistics & Probability Letters, 13, 325-332. [Google Scholar] [CrossRef
[21] Jansakul, N. and Hinde, J.P. (2002) Score Tests for Zero-Inflated Poisson Models. Computational Statistics & Data Analysis, 40, 75-96. [Google Scholar] [CrossRef
[22] Li, C., Lu, J., Park, J., Kim, K., Brinkley, P.A. and Peterson, J.P. (1999) Multivariate Zero-Inflated Poisson Models and Their Applications. Technometrics, 41, 29-38. [Google Scholar] [CrossRef