基于NGINAR(p)模型的贝叶斯非参数预测
Bayesian Nonparametric Forecasting for NGINAR(p) Models
摘要: 本文将Bisaglia和Canale (2016)提出的贝叶斯非参数预测框架从INAR模型拓展至基于负二项稀疏算子的广义p阶整数值自回归模型(NGINAR(p))。该拓展利用负二项稀疏算子刻画高阶依赖结构与过度离散特征,并采用舍入高斯混合先验对创新项分布进行非参数建模,从而灵活捕捉计数数据中普遍存在的多峰、偏态等复杂分布形态。针对高阶模型的结构复杂性,构建了基于数据增强的Gibbs采样算法,实现自回归系数与创新项分布的联合后验推断,并直接获得整数值的h步向前预测分布。模拟研究与中国某地区月度总云量数据的实证分析表明,该方法在不同样本量、自回归系数及创新项分布设定下均表现出良好的预测精度与稳定性,在处理高阶复杂计数时间序列时具有显著的适应性与有效性。
Abstract: This paper extends the Bayesian nonparametric prediction framework proposed by Bisaglia and Canale (2016) from the INAR model to the generalized p-order integer-valued autoregressive model based on the negative binomial thinning operator (NGINAR(p)). This extension utilizes the negative binomial thinning operator to describe the high-order dependency structure and overdispersion characteristics, and employs a rounded Gaussian mixture prior for nonparametric modeling of the innovation term distribution, thereby flexibly capturing complex distributional forms such as multimodality and skewness commonly found in count data. To address the structural complexity of high-order models, a data augmentation-based Gibbs sampling algorithm is constructed to jointly infer the posterior distribution of the autoregressive coefficients and the innovation term distribution, and directly obtain the h-step ahead predictive distribution of integer values. Simulation studies and empirical analysis of monthly total cloud cover data from a certain region in China demonstrate that this method exhibits excellent prediction accuracy and stability under different sample sizes, autoregressive coefficient settings, and innovation term distribution assumptions, and shows significant adaptability and effectiveness in handling high-order complex count time series.
文章引用:姜姝, 卢飞龙. 基于NGINAR(p)模型的贝叶斯非参数预测[J]. 应用数学进展, 2026, 15(5): 511-522. https://doi.org/10.12677/aam.2026.155247

参考文献

[1] Steutel, F.W. and van Harn, K. (1979) Discrete Analogues of Self-Decomposability and Stability. The Annals of Probability, 7, 893-899.
[2] Al-Osh, M.A. and Alzaid, A.A. (1987) First-Order Integer-Valued Autoregressive (INAR(1)) Process. Journal of Time Series Analysis, 8, 261-275. [Google Scholar] [CrossRef
[3] Mckenzie, E. (1988) Some ARMA Models for Dependent Sequences of Poisson Counts. Advances in Applied Probability, 20, 822-835. [Google Scholar] [CrossRef
[4] Du, J.G. and Li, Y. (1991) The Integer-Valued Autoregressive (INAR(p)) Model. Journal of Time Series Analysis, 12, 129-142. [Google Scholar] [CrossRef
[5] Al-Osh, M. and Alzaid, A.A. (1988) Integer-Valued Moving Average (INMA) Process. Statistical Papers, 29, 281-300. [Google Scholar] [CrossRef
[6] Ferland, R., Latour, A. and Oraichi, D. (2006) Integer-Valued GARCH Process. Journal of Time Series Analysis, 27, 923-942. [Google Scholar] [CrossRef
[7] Fokianos, K., Rahbek, A. and Tjøstheim, D. (2009) Poisson Autoregression. Journal of the American Statistical Association, 104, 1430-1439. [Google Scholar] [CrossRef
[8] Weiß, C.H. (2018) An Introduction to Discrete-Valued Time Series. Wiley. [Google Scholar] [CrossRef
[9] Ristić, M.M., Bakouch, H.S. and Nastić, A.S. (2009) A New Geometric First-Order Integer-Valued Autoregressive (NGINAR(1)) Process. Journal of Statistical Planning and Inference, 139, 2218-2226. [Google Scholar] [CrossRef
[10] Nastić, A.S., Ristić, M.M. and Bakouch, H.S. (2012) A Combined Geometric INAR(p) Model Based on Negative Binomial Thinning. Mathematical and Computer Modelling, 55, 1665-1672. [Google Scholar] [CrossRef
[11] Bisaglia, L. and Canale, A. (2016) Bayesian Nonparametric Forecasting for INAR Models. Computational Statistics & Data Analysis, 100, 70-78. [Google Scholar] [CrossRef
[12] Canale, A. and Dunson, D.B. (2011) Bayesian Kernel Mixtures for Counts. Journal of the American Statistical Association, 106, 1528-1539. [Google Scholar] [CrossRef] [PubMed]
[13] Lo, A.Y. (1984) On a Class of Bayesian Nonparametric Estimates: I. Density Estimates. The Annals of Statistics, 12, 351-357. [Google Scholar] [CrossRef
[14] Escobar, M.D. and West, M. (1995) Bayesian Density Estimation and Inference Using Mixtures. Journal of the American Statistical Association, 90, 577-588. [Google Scholar] [CrossRef
[15] Johnson, N.L., Kemp, A.W. and Kotz, S. (2005) Univariate Discrete Distributions. Wiley. [Google Scholar] [CrossRef
[16] Fokianos, K. and Kedem, B. (2004) Partial Likelihood Inference for Time Series Following Generalized Linear Models. Journal of Time Series Analysis, 25, 173-197. [Google Scholar] [CrossRef
[17] Sethuraman, J. (1994) A Constructive Definition of Dirichlet Priors. Statistica Sinica, 4, 639-650.
[18] Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H. and Teller, E. (1953) Equation of State Calculations by Fast Computing Machines. The Journal of Chemical Physics, 21, 1087-1092. [Google Scholar] [CrossRef
[19] Hastings, W.K. (1970) Monte Carlo Sampling Methods Using Markov Chains and Their Applications. Biometrika, 57, 97-109.
[20] Poisson, S.D. (1837) Recherches sur la probabilité des jugements en matière criminelle et en matière civile: Précédées des règles générales du calcul des probabilités. Bachelier.
[21] Greenwood, M. and Yule, G.U. (1920) An Inquiry into the Nature of Frequency Distributions Representative of Multiple Happenings with Particular Reference to the Occurrence of Multiple Attacks of Disease or of Repeated Accidents. Journal of the Royal Statistical Society, 83, 255-279. [Google Scholar] [CrossRef
[22] Bernoulli, J. and Sheynin, O. (2005) On the Law of Large Numbers. NG Verlag.
[23] Brier, G.W. (1950) Verification of Forecasts Expressed in Terms of Probability. Monthly Weather Review, 78, 1-3. [Google Scholar] [CrossRef
[24] Gneiting, T. and Raftery, A.E. (2007) Strictly Proper Scoring Rules, Prediction, and Estimation. Journal of the American Statistical Association, 102, 359-378. [Google Scholar] [CrossRef
[25] Binder, K. and Heermann, D.W. (1992) Monte Carlo Simulation in Statistical Physics. Springer.
[26] Kroese, D.P., Taimre, T. and Botev, Z.I. (2013) Handbook of Monte Carlo Methods. Wiley.
[27] Hershey, J.R. and Olsen, P.A. (2007) Approximating the Kullback Leibler Divergence between Gaussian Mixture Models. 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, Honolulu, 15-20 April 2007, 317-320. [Google Scholar] [CrossRef