自激点过程在相依事件中的建模及其应用
Modeling and Application of Self-Exciting Point Process in Dependent Events
DOI: 10.12677/aam.2025.144202, PDF,    科研立项经费支持
作者: 戴 乐*, 高 犇, 李晨龙, 郭 平#:太原理工大学数学学院,山西 晋中
关键词: 点过程大数据建模分布式推断相依事件序列Point Process Big Data Modeling Distributed Inference Dependent Event Sequence
摘要: 相依事件序列广泛存在于金融、社交网络等诸多领域,其事件间具有强烈的依赖性,使得传统事件序列建模方法难以准确刻画其动态特征。为了有效地描述事件间的依赖关系,基于自激点过程构建了扩展模型,将自激函数推广到指数函数和形式,并提出了分布式统计推断方法。通过将数据划分至多个节点进行并行计算,再聚合估计结果,解决了传统极大似然估计在大数据处理中计算成本高和效率低的问题,为自激点过程在大数据中的应用提供了新的解决方案。仿真实验结果表明,分布式估计在有限样本下与传统全局估计表现一致,同时将运行时间缩短约70%。在实证分析中,自激点过程模型有效刻画了Boston犯罪数据和IPTV用户点播行为数据的趋势,并将计算时间分别提升了约95%和64%。
Abstract: The sequence of dependent events is widely present in various fields, such as finance and social networks, where strong dependencies between events make it difficult for traditional event sequence modeling methods to accurately capture their dynamic characteristics. To effectively describe these dependencies, an extended model based on the self-exciting point process is constructed, generalizing the self-exciting function to a sum of exponential functions. A distributed statistical inference method is proposed, which divides the data across multiple nodes for parallel computation and then aggregates the estimation results. This approach addresses the high computational cost and low efficiency of traditional maximum likelihood estimation in big data processing, providing a new solution for applying self-exciting point processes to big data. Simulation experiments show that the distributed estimation performs consistently with traditional global estimation on limited samples while reducing runtime by approximately 70%. In empirical analysis, the self-exciting point process model effectively captures trends in Boston crime data and IPTV user on-demand behavior data, improving computation time by approximately 95% and 64%, respectively.
文章引用:戴乐, 高犇, 李晨龙, 郭平. 自激点过程在相依事件中的建模及其应用[J]. 应用数学进展, 2025, 14(4): 744-754. https://doi.org/10.12677/aam.2025.144202

参考文献

[1] Baum, L.E. and Petrie, T. (1966) Statistical Inference for Probabilistic Functions of Finite State Markov Chains. The Annals of Mathematical Statistics, 37, 1554-1563. [Google Scholar] [CrossRef
[2] Rabiner, L.R. (1990) A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. In: Waibel, A. and Lee, K.-F., Eds., Readings in Speech Recognition, Elsevier, 267-296. [Google Scholar] [CrossRef
[3] Trinh, M. (2018) Non-Stationary Processes and Their Application to Financial High-Frequency Data. University of Sussex.
[4] Cifuentes-Amado, M.V. and Cepeda-Cuervo, E. (2015) Non-Homogeneous Poisson Process to Model Seasonal Events: Application to the Health Diseases. International Journal of Statistics in Medical Research, 4, 337-346. [Google Scholar] [CrossRef
[5] Hawkes, A.G. (1971) Spectra of Some Self-Exciting and Mutually Exciting Point Processes. Biometrika, 58, 83-90. [Google Scholar] [CrossRef
[6] Kwon, J., Zheng, Y. and Jun, M. (2023) Flexible Spatio-Temporal Hawkes Process Models for Earthquake Occurrences. Spatial Statistics, 54, Article ID: 100728. [Google Scholar] [CrossRef
[7] Shah, R., et al. (2020) Temporal Point Process Models for Nepal Earthquake Aftershocks. International Journal of Statistics and Reliability Engineering, 7, 275-285.
[8] Ding, X., Shi, J., Duan, J., Qin, B. and Liu, T. (2021) Quantifying the Effects of Long-Term News on Stock Markets on the Basis of the Multikernel Hawkes Process. Science China Information Sciences, 64, Article ID: 192102. [Google Scholar] [CrossRef
[9] Zhuo, J., Chen, Y., Zhou, B., et al. (2023) A Hawkes Process Analysis of High-Frequency Price Endogeneity and Market Efficiency. The European Journal of Finance, 30, 1-31.
[10] Kobayashi, R. and Lambiotte, R. (2021) Tideh: Time-Dependent Hawkes Process for Predicting Retweet Dynamics. Proceedings of the International AAAI Conference on Web and Social Media, 10, 191-200. [Google Scholar] [CrossRef
[11] Nie, H.R., Zhang, X., Li, M., Dolgun, A. and Baglin, J. (2020) Modelling User Influence and Rumor Propagation on Twitter Using Hawkes Processes. 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), Sydney, 6-9 October 2020, 637-656. [Google Scholar] [CrossRef
[12] Goda, M., Mizuno, T. and Yano, R. (2022) Hawkes Process Marked with Topics and Its Application to Twitter Data Analysis. Europhysics Letters, 140, 61001. [Google Scholar] [CrossRef
[13] Gao, Y., Liu, W., Wang, H., Wang, X., Yan, Y. and Zhang, R. (2021) A Review of Distributed Statistical Inference. Statistical Theory and Related Fields, 6, 89-99. [Google Scholar] [CrossRef
[14] Clinet, S. and Potiron, Y. (2018) Statistical Inference for the Doubly Stochastic Self-Exciting Process. Bernoulli, 24, 3469-3493. [Google Scholar] [CrossRef
[15] Ozaki, T. (1979) Maximum Likelihood Estimation of Hawkes’ Self-Exciting Point Processes. Annals of the Institute of Statistical Mathematics, 31, 145-155. [Google Scholar] [CrossRef
[16] Oakes, D. (1975) The Markovian Self-Exciting Process. Journal of Applied Probability, 12, 69-77. [Google Scholar] [CrossRef
[17] Hardiman, S.J., Bercot, N. and Bouchaud, J. (2013) Critical Reflexivity in Financial Markets: A Hawkes Process Analysis. The European Physical Journal B, 86, 1-9. [Google Scholar] [CrossRef
[18] Pratiwi, H., Slamet, I., Saputro, D.R.S. and Respatiwulan, (2017) Self-Exciting Point Process in Modeling Earthquake Occurrences. Journal of Physics: Conference Series, 855, Article ID: 012033. [Google Scholar] [CrossRef
[19] Zhang, R., Walder, C. and Rizoiu, M. (2020) Variational Inference for Sparse Gaussian Process Modulated Hawkes Process. Proceedings of the AAAI Conference on Artificial Intelligence, 34, 6803-6810. [Google Scholar] [CrossRef
[20] Clinet, S. and Yoshida, N. (2017) Statistical Inference for Ergodic Point Processes and Application to Limit Order Book. Stochastic Processes and Their Applications, 127, 1800-1839. [Google Scholar] [CrossRef