XGBoost与ARIMA在股票画像中的应用研究
Research on the Application of XGBoost and ARIMA in Stock Portrait
摘要: 本研究基于中国石油2023~2024年的实时数据,构建了ARIMA和XGBoost模型,以评估和比较它们在股票趋势预测中的性能。结果显示,XGBoost在训练集上具有强大的非线性拟合能力,但在测试集上错误波动性更高。ARIMA模型虽解释能力较弱,但在短期预测中更稳定,适合低延迟场景。建议在高频交易中使用ARIMA模型(响应时间小于50毫秒),而将XGBoost优先用于事件驱动策略。本研究的局限性在于仅使用单一资产和静态时间窗口数据。未来研究将引入多因素(如VIX指数、情绪数据)及动态滚动预测框架以提升混合模型的泛化能力。本研究为优化混合模型在特征工程、权重分配及评估系统方面的理论支持,并为金融机器学习从“模型堆砌”向“机制驱动型混合模型”转型提供参考。
Abstract: This study, based on PetroChina’s 2023~2024 real-time data, builds ARIMA and XGBoost models to evaluate and compare their performance in stock trend prediction. Results show XGBoost has strong non-linear fitting ability on the training set but higher error volatility on the test set. ARIMA, though less explanatory, is more stable in short-term forecasting, suiting low-latency scenarios. It’s recommended to use ARIMA (with a response time of less than 50 ms) for high-frequency trading and prioritize XGBoost for event-driven strategies. This study’s limitations lie in its single-asset and static time-window data. Future work will introduce multi-factors (e.g., VIX index, sentiment data) and dynamic rolling prediction frameworks to enhance the hybrid model’s generalization. The study offers theoretical support for optimizing hybrid models in feature engineering, weight allocation, and evaluation systems, and references for shifting financial machine learning from “model piling” to “mechanism-driven hybrids”.
文章引用:叶环宇, 王章琴, 蓝尉太, 贾礼平. XGBoost与ARIMA在股票画像中的应用研究[J]. 统计学与应用, 2025, 14(8): 74-84. https://doi.org/10.12677/sa.2025.148217

参考文献

[1] 罗露. 基于ARIMA模型的股票价格预测分析[J]. 应用数学进展, 2022, 11(12): 9096-9102. [Google Scholar] [CrossRef
[2] Ariyo, A.A., Adewumi, A.O. and Ayo, C.K. (2014) Stock Price Prediction Using the ARIMA Model. 2014 UK Sim-AMSS 16th International Conference on Computer Modelling and Simulation, Cambridge, 26-28 March 2014, 106-112. [Google Scholar] [CrossRef
[3] 张卫国, 高东, 刘宝民. 基于ARIMA-GARCH模型的沪深300指数波动率预测[J]. 财经研究, 2018, 44(5): 96-106.
[4] 陈伟. 基于ARIMA模型的股票价格预测[J]. 统计与决策, 2024, 40(3): 45-50.
[5] 贾雪, 吴芷婧, 孙佳萍, 等. 基于GARCH模型对股票市场进行分析预测[J]. 金融研究, 2021, 38(5): 112-120.
[6] 李斌, 龙真. 中国股票市场可预测性研究: 基于机器学习的视角[J]. 管理科学学报, 2023, 26(10): 138-158.
[7] 王燕, 郭元凯. 基于XGBoost模型的股票预测研究[D]: [硕士学位论文]. 合肥: 中国科学技术大学, 2023.
[8] 贾雨菲. 基于机器学习的股票价格预测研究[J]. 贵州大学学报(社会科学版), 2024, 42(2): 56-65.
[9] 张天瑞. 基于EEMD-GA-XGBoost模型的股票价格预测研究[D]: [硕士学位论文]. 武汉: 华中科技大学, 2023.
[10] 张毅, 田浩. XGBoost在量化选股中的应用研究[J]. 金融管理研究, 2020, 12(2): 45-56.
[11] 王燕. 基于网格搜索优化的XGBoost模型及其在股票预测中的应用[J]. 金融研究, 2019, 45(12): 127-139.
[12] Python注意力机制Attention下CNN-LSTM-ARIMA混合模型预测中国银行股票价格[EB/OL].
https://tecdat.cn/python%E6%B3%A8%E6%84%8F%E5%8A%9B%E6%9C%BA%E5%88%B6attention%E4%B8%8Bcnn-lstm-arima%E6%B7%B7%E5%90%88%E6%A8%A1%E5%9E%8B%E9%A2%84%E6%B5%8B%E4%B8%AD%E5%9B%BD%E9%93%B6%E8%A1%8C%E8%82%A1%E7%A5%A8%E4%BB%B7/, 2025-06-16.
[13] Li, T., Liu, Z.Y., Shen, Y.Y., et al. (2023) MASTER: Market-Guided Stock Transformer for Stock Price Forecasting. arXiv: 2312.15235v1.
https://arxiv.org/pdf/2312.15235
[14] Fan, J. and Shen, Y. (2024) StockMixer: A Simple Yet Strong MLP-Based Architecture for Stock Price Forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, 38, 8389-8397. [Google Scholar] [CrossRef
[15] Xia, H., Ao, H., Li, L., Liu, Y., Liu, S., Ye, G., et al. (2024) CI-STHPAN: Pre-Trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph. Proceedings of the AAAI Conference on Artificial Intelligence, 38, 9187-9195. [Google Scholar] [CrossRef