基于改进梯度下降算法的XGBoost模型研究
Research on XGBoost Model Based on Improved Gradient Descent Algorithm
DOI: 10.12677/orf.2024.146529, PDF,   
作者: 黄 丽:南京信息工程大学数学与统计学院,江苏 南京
关键词: XGBoost随机梯度下降房价预测分数阶微积分XGBoost Stochastic Gradient Descent Housing Price Forecast Fractional Calculus
摘要: 分数阶微积分的优良性质为基于梯度下降的优化方法提供了新的前景,相比于传统整数阶,分数阶微积分可以通过不断更新来调整阶数。XGBoost模型结合了正则化技术和梯度提升决策树,具有高效性和准确性。但在实际应用中,XGBoost模型仍存在训练速度较慢、易过拟合等局限性。为了解决这些问题,本文所做工作主要有如下几个方面:1) 基于Caputo定义引入了分数阶微积分的基本内容。作为一种经典的凸优化算法,梯度下降法在结合分数阶微积分理论后延伸出了分数阶梯度下降算法。2) 在简化目标函数的过程中,将分数阶梯度下降算法与XGBoost模型相结合,并选择合适的分数阶,得到新的分数阶XGBoost模型,在保证准确性的同时提高了训练速度。3) 利用改进后的XGBoost模型对房价预测数据集进行实证分析。与传统的整数阶XGBoost模型进行对比分析,改进后的模型具有更快的收敛速度和更好的预测性能。
Abstract: The excellent properties of fractional calculus provide a new prospect for optimization methods based on gradient descent, which can be continuously updated to adjust the order compared to the traditional integer order. The XGBoost model combines regularization techniques and gradient boosting decision trees, which is efficient and accurate. However, in practical applications, the XGBoost model still has limitations such as slow training speed and easy overfitting. In order to solve these problems, the work done in this paper is mainly as follows: 1) Based on the Caputo definition, the basic content of fractional calculus is introduced. As a classical convex optimization algorithm, the gradient descent method is extended to the fractional gradient descent algorithm after combining the fractional calculus theory. 2) In the process of simplifying the objective function, the fractional gradient descent algorithm is combined with the XGBoost model, and the appropriate fractional order is selected to obtain a new fractional XGBoost model, which improves the training speed while ensuring the accuracy. 3) The improved XGBoost model is used to conduct an empirical analysis of the housing price prediction dataset. Compared with the traditional integer order XGBoost model, the improved model has faster convergence speed and better prediction performance.
文章引用:黄丽. 基于改进梯度下降算法的XGBoost模型研究[J]. 运筹与模糊学, 2024, 14(6): 266-279. https://doi.org/10.12677/orf.2024.146529

参考文献

[1] 史加荣, 王丹, 尚凡华, 等. 随机梯度下降算法研究进展[J]. 自动化学报, 2021, 47(9): 2103-2119.
[2] 王书博. 基于改进梯度下降法求解多元线性回归方程[J]. 数学的实践与认识, 2022, 52(10): 167-172.
[3] 谢景伊. 基于时间分数阶梯度下降方法的神经网络训练方法[D]: [硕士学位论文]. 贵阳: 贵州大学, 2022.
[4] 许少华, 宋美玲, 许辰, 等. 一种基于混合误差梯度下降算法的过程神经网络训练[J]. 东北石油大学学报, 2014, 38(4): 92-96+11-12.
[5] Chen, Y., Gao, Q., Wei, Y. and Wang, Y. (2017) Study on Fractional Order Gradient Methods. Applied Mathematics and Computation, 314, 310-321. [Google Scholar] [CrossRef
[6] Pu, Y., Zhou, J., Zhang, Y., Zhang, N., Huang, G. and Siarry, P. (2015) Fractional Extreme Value Adaptive Training Method: Fractional Steepest Descent Approach. IEEE Transactions on Neural Networks and Learning Systems, 26, 653-662. [Google Scholar] [CrossRef] [PubMed]
[7] Chaudhary, N.I., Zubair, S. and Raja, M.A.Z. (2017) A New Computing Approach for Power Signal Modeling Using Fractional Adaptive Algorithms. ISA Transactions, 68, 189-202. [Google Scholar] [CrossRef] [PubMed]
[8] Wang, Y., He, Y. and Zhu, Z. (2022) Study on Fast Speed Fractional Order Gradient Descent Method and Its Application in Neural Networks. Neurocomputing, 489, 366-376. [Google Scholar] [CrossRef
[9] Wang, J., Wen, Y., Gou, Y., Ye, Z. and Chen, H. (2017) Fractional-Order Gradient Descent Learning of BP Neural Networks with Caputo Derivative. Neural Networks, 89, 19-30. [Google Scholar] [CrossRef] [PubMed]
[10] Khan, S., Naseem, I., Malik, M.A., Togneri, R. and Bennamoun, M. (2018) A Fractional Gradient Descent-Based RBF Neural Network. Circuits, Systems, and Signal Processing, 37, 5311-5332. [Google Scholar] [CrossRef
[11] Chen, M., Chen, B., Zeng, G., Lu, K. and Chu, P. (2020) An Adaptive Fractional-Order BP Neural Network Based on Extremal Optimization for Handwritten Digits Recognition. Neurocomputing, 391, 260-272. [Google Scholar] [CrossRef
[12] 阚涛, 高哲, 杨闯. 采用分数阶动量的卷积神经网络随机梯度下降法[J]. 模式识别与人工智能, 2020, 33(6): 5-9.
[13] 李兴怡, 岳洋. 梯度下降算法研究综述[J]. 软件工程, 2020, 23(2): 1-4.
[14] 刘伟. 基于改进的XGBoost的短期交通流预测[D]: [硕士学位论文]. 太原: 太原理工大学, 2020.
[15] Wang, Y., He, Y. and Zhu, Z. (2022) Study on Fast Speed Fractional Order Gradient Descent Method and Its Application in Neural Networks. Neurocomputing, 489, 366-376. [Google Scholar] [CrossRef
[16] 李梦宇, 张泽亚, 张知, 等. 基于CART算法的电能表故障概率决策树分析[J]. 电力大数据, 2017, 20(10): 7-10+60.
[17] 陈玉全. 分数阶梯度下降法基础理论研究[D]: [博士学位论文]. 合肥: 中国科学技术大学, 2020.
[18] 原琨. 基于XGBoost的移动优惠券使用预测及影响因素分析[J]. 广西质量监督导报, 2018(12): 99-100.