聚类复发事件加性模型的分段常数化估计
Estimation of an Additive Rate Model for Clustered Recurrent Events Based on the Piecewise Constant Method
DOI: 10.12677/AAM.2023.122057, PDF,   
作者: 郭 蒋:北京信息科技大学理学院,北京
关键词: 大型医疗数据聚类复发事件边际比率模型估计方程分段常数化
摘要: 研究大型的医疗数据具有重要意义,但大量数据会导致计算复杂度变高,尤其涉及到反复发生的疾病数据时。本文对大型数据的聚类复发事件提出了一个半参数边际加性比率函数模型。在进行参数估计时,随着数据量级的增加而导致未知参数增加,为了提高计算效率,对基准比率函数进行分段常数化。利用估计方程方法给出了模型参数的估计量的表达式。并证明估计量满足相合性和渐近正态性等大样本性质。通过数值模拟,验证了提出的估计方法,模拟结果表明对参数部分的估计结果较好。与对基准比率函数未做分段常数化的方法进行比较,发现分段常数化的方法偏差更小,而且计算时间显著缩短。最后将模型和方法应用到慢性肉芽肿病的数据中,将病人按照医院进行分类,找到影响各医院病人病情复发的显著因素和因素的影响方式。
Abstract: Studying large medical databases is important. However, the use of large databases may introduce computational difficulties, particularly when the event of interest is recurrent. A semiparametric marginal additive rate function model is proposed for large databases clustered recurrence events. In the parameter estimation, the number of baseline rate functions increases as the number of classes increases, leading to an increase in the parameters. The baseline rate functions were re-garded as piecewise constant during the estimation, which improved the computational efficiency. The expressions of the estimate of the model parameters were given using the estimating equation method. The estimators were proved consistent and asymptotic normality. The proposed estima-tion method is verified by numerical simulation, and the simulation results show that the estima-tion results of the parameter part are good, compared with the method without piecewise constant of the risk ratio function. It is found that the bias of the piecewise constant method is smaller, and the calculation time is significantly shortened. Finally, the model and method were applied to the data of chronic granulomatous disease, and the patients were classified by hospital to find out the significant factors affecting patients' disease recurrence in each hospital and the influencing ways of the factors.
文章引用:郭蒋. 聚类复发事件加性模型的分段常数化估计[J]. 应用数学进展, 2023, 12(2): 537-549. https://doi.org/10.12677/AAM.2023.122057

参考文献

[1] Adersen, P.K. and Gill, R.D. (1982) Cox’s Regression Model for Counting Processes: A Large Sample Study. Annals of Statistics, 10, 1100-1120. [Google Scholar] [CrossRef
[2] Zeng, D.L. and Lin, D.Y. (2006) Effi-cient Estimation of Semiparametric Transformation Models for Counting Processes. Biometrika, 93, 627-640. [Google Scholar] [CrossRef
[3] Lin, D.Y., Wei, L.J., Yang, I. and Ying, Z. (2000) Semiparametric Regression for the Mean and Rate Function of Recurrent Events. Journal of the Royal Statistical Society: Series B (Sta-tistical Methodology), 62, 711-730. [Google Scholar] [CrossRef
[4] Dai, J.J., Sun, L.Q. and Yang, Z.H. (2009) A General Addi-tive-Multiplicative Rates Model for Recurrent Event Data. Science in China, 52, 2257-2265. [Google Scholar] [CrossRef
[5] Lawless, J.F., Nadeau, C. and Cook, R.J. (1997) Analysis of Mean and Rate Functions for Recurrent Events. In: Lin, D.Y. and Fleming, T.R., Eds., Proceedings of the First Seattle Symposium in Biostatistics. Lecture Notes in Statistics, Vol. 123, Springer, New York, 37-49. [Google Scholar] [CrossRef
[6] Liu, Y.Y. and Wu, Y.S. (2011) Semiparametric Additive In-tensity Model with Frailty for Recurrent Events. Acta Mathematica Sinica, English Series, 27, 1831-1842. [Google Scholar] [CrossRef
[7] Cai, J.H., He, H.J., Song, X.Y. and Sun, L.Q. (2017) An Addi-tive-Multiplicative Mean Residual Life Model for Right- Censored Data. Biometrical Journal, 59, 579-592. [Google Scholar] [CrossRef] [PubMed]
[8] Sun, X.W. and Sun, L.Q. (2019) Semiparametric Estimation of Differences in Treatment-Specific Recurrent Event Means with a Terminal Event. Statistics and Its Interface, 12, 1-9. [Google Scholar] [CrossRef
[9] 马燕, 孙晓伟. 带终止事件的特定治疗复发事件均值比例的半参估计[J]. 应用数学学报, 2020, 43(6): 949-965.
[10] Xu, G.J, Chiou, S.H., Huang, C.Y. and Wang, M.C. (2017) Joint Scale-Change Models for Recurrent Events and Failure Time. Journal of the American Statistical Associa-tion, 112, 794-805. [Google Scholar] [CrossRef] [PubMed]
[11] Chan, K.C.G. and Wang, M.C. (2017) Semiparametric Modeling and Estimation of the Terminal Behavior of Recurrent Marker Processes Before Failure Events. Journal of the American Statistical Association, 112, 351-362. [Google Scholar] [CrossRef] [PubMed]
[12] Sun, Y.F., McCulloch, C.E., Marr, K.A. and Huang, C.-Y. (2021) Recurrent Events Analysis with Data Collected at Informative Clinical Visits in Electronic Health Records. Journal of the American Statistical Association, 116, 593-604. [Google Scholar] [CrossRef] [PubMed]
[13] Lyu, T., Luo, X.H. and Sun, Y.F. (2021) Addi-tive-Multiplicative Rates Model for Recurrent Event Data with Intermittently Observed Time-Dependent Covariates. Journal of Data Science, 19, 615-633. [Google Scholar] [CrossRef
[14] 杜彦斌, 戴家佳, 金君. 多类型复发事件数据下一类半参数转移模型[J]. 统计与信息论坛, 2018, 33(4): 20-24.
[15] 杨青龙, 江芹. 多类型复发间隔时间下广义半参数风险模型[J]. 应用数学学报, 2022, 45(1): 72-87.
[16] Schaubel, D.E. and Cai, J. (2005) Semiparametric Methods for Clustered Recurrent Event Data. Lifetime Data Analysis, 11, 405-425. [Google Scholar] [CrossRef] [PubMed]
[17] Liu, D.D., Schaubel, D.E. and Kalbfleisch, J.D. (2012) Compu-tationally Efficient Marginal Models for Clustered Recurrent Event Data. Biometrics, 68, 637-647. [Google Scholar] [CrossRef] [PubMed]
[18] Liu, D.D., Kalbfleisch, J.D. and Schaubel, D.E. (2014) Methods for Estimating Center Effects on Recurrent Events. Statistics in Biosciences, 6, 19-37. [Google Scholar] [CrossRef] [PubMed]
[19] He, H.J., Pan, D., Sun, L., et al. (2017) Analysis of a Fixed Cen-ter Effect Additive Rates Model for Recurrent Event Data. Computational Statistics & Data Analysis, 112, 186-197. [Google Scholar] [CrossRef
[20] Sun, X.W., Song, X.Y. and Sun, L.Q. (2021) Additive Hazard Regression of Event History Studies with Intermittently Measured Covariates. Canadian Journal of Statistics, 50, 511-531. [Google Scholar] [CrossRef