融合集成算法与宽度学习的商品需求量预测
Fusing Integrated Algorithms with Broad Learning System for Commodity Demand Forecasting
DOI: 10.12677/AAM.2023.1212516, PDF,    科研立项经费支持
作者: 曾诗懿*, 何青霞, 张 宇, 赵 锋, 张 彤:重庆理工大学理学院,重庆;苏理云:重庆理工大学理学院,重庆;重庆理工大学时空大数据研究中心,重庆
关键词: 需求量预测特征工程宽度学习(BLS)XGBoostLightGBMDemand Forecasting Feature Engineering Broad Learning System XGBoost LightGBM
摘要: 由于电商订单销售数据中全品类商品的种类繁多、分类多层化,且数据集还存在时序长度分布不均、地区差异、定性变量及非线性特征变量的处理等问题,导致需求量的预测任务较困难。为了解决上述问题,本研究提出一种宽度学习集成框架,将机器学习中的Random Forest、GBDT、XGBoost和LightGBM与宽度学习模型进行随机融合,并分别进行验证,对比模型效果。经实证分析结果表明:LightGBM-BLS模型具有最优的预测性能和计算性能,它在保持LightGBM模型计算优势的同时,大幅度地提升了模型本身的预测精度,使拟合优度达到0.99,评价指标RMSE、MSE降低90%以上,MAE降低85%以上。
Abstract: Due to the wide variety of full-category commodities in e-commerce order sales data, multi-layered categorization, and the dataset also has the problems of uneven distribution of time-series lengths, regional differences, and the treatment of qualitative variables and non-linear feature variables, which leads to a more difficult task of demand prediction. To solve the above problems, this study proposes a breadth learning integration framework, which stochastically fuses Random Forest, GBDT, XGBoost and LightGBM in machine learning with the breadth learning model, and validates and compares the model effects respectively. Empirical analysis results show that the LightGBM-BLS model has optimal prediction performance and computational performance, which maintains the computational advantages of the LightGBM model while substantially improving the prediction accuracy of the model itself, so that the goodness of fit reaches 0.99, and the evaluation indexes of RMSE and MSE are reduced by more than 90%, and MAE is reduced by more than 85%.
文章引用:曾诗懿, 苏理云, 何青霞, 张宇, 赵锋, 张彤. 融合集成算法与宽度学习的商品需求量预测[J]. 应用数学进展, 2023, 12(12): 5254-5266. https://doi.org/10.12677/AAM.2023.1212516

参考文献

[1] Yin, X.T. and Tao, X.S. (2021) Prediction of Merchandise Sales on E-Commerce Platforms Based on Data Mining and Deep Learning. Scientific Programming, 6, 1-9. [Google Scholar] [CrossRef
[2] Hu, W. and Zhang, X.C. (2020) Commodity Sales Forecast Based on ARIMA Model Residual Optimization. 2020 IEEE 5th International Conference on Communication, Image and Signal Processing (CCISP), Chengdu, 13-15 November 2020, 229-233. [Google Scholar] [CrossRef
[3] Singh, B., Kumar, P., Sharma, N., et al. (2020) Sales Forecast for Amazon Sales with Time Series Modeling. 2020 IEEE 1st International Conference on Power, Control and Computing Technologies (ICPC2T), Raipur, 3-5 January 2020, 38-43. [Google Scholar] [CrossRef
[4] 胡小芳. 基于电商数据的商品供需平衡研究[D]: [硕士学位论文]. 广州: 暨南大学, 2022.
[5] Ensafi, Y., Amin, S.H., Zhang, G.Q., et al. (2022) Time-Series Forecast-ing of Seasonal Items Sales Using Machine Learning—A Comparative Analysis. International Journal of Information Management Data Insights, 2, Article ID: 100058. [Google Scholar] [CrossRef
[6] 黄国兴, 曹先怀, 钱晓飞. 一种基于随机森林的备件预测模型研究[J]. 运筹与管理, 2021, 30(10): 165-168.
[7] 荆浩, 刘垭, 唐金环. 基于多变量支持向量机的供应链需求预测分析[J]. 系统工程, 2018, 36(11): 121-126.
[8] 贺毅岳, 韩进博, 高妮. 基于EEMD-SVR的沪深300指数预测建模[J]. 统计与决策, 2020, 36(17): 152-156.
[9] 李杰, 王玉霞, 赵旭东. 电商企业商品销量的预测方法[J]. 统计与决策, 2018, 34(22): 176-179. [Google Scholar] [CrossRef
[10] 刘辰阳. J社区团购平台基于XGBOOST的快消品销量预测方法[D]: [硕士学位论文]. 大连: 大连理工大学, 2022.
[11] 吴霆辉. 基于遗传算法优化LightGBM-XGBoost模型的电力负荷预测[J]. 科学技术创新, 2023(3): 71-75.
[12] 谢勇, 项薇, 季孟忠, 彭俊, 黄益槐. 基于Xgboost和LightGBM算法预测住房月租金的应用分析[J]. 计算机应用与软件, 2019, 36(9): 151-155+191.
[13] 王晓晖, 张亮, 李俊清, 等. 基于遗传算法与随机森林的XGBoost改进方法研究[J]. 计算机科学, 2020, 47(S2): 454-458+463.
[14] 霍佳震, 徐骏, 陈铭洲. 基于EEMD-Holt-Winters-GBDT模型的零售商品销量多步预测[J]. 工业工程与管理, 2023: 1-14.
[15] Wang, D.-N., Li, L. and Zhao, D. (2022) Corporate Finance Risk Prediction Based on LightGBM. Information Sciences, 602, 259-268. [Google Scholar] [CrossRef
[16] Wang, Y.Y., Chen, J., Chen, X.Q., et al. (2021) Short-Term Load Forecasting for Industrial Customers Based on TCN- LightGBM. IEEE Transactions on Power Systems, 36, 1984-1997. [Google Scholar] [CrossRef
[17] Chen, C.L. and Liu, Z.L. (2018) Broad Learning System: An Effective and Efficient Incremental Learning System without the Need for Deep Architecture. IEEE Transactions on Neural Networks and Learning Systems, 29, 10-24. [Google Scholar] [CrossRef
[18] Su, L.Y., Xiong, L. and Yang, J.L. (2023) Multi-Attn BLS: Multi-Head Attention Mechanism with Broad Learning System for Chaotic Time Series Prediction. Applied Soft Compu-ting, 132, Article ID: 109831. [Google Scholar] [CrossRef
[19] 褚菲, 苏嘉铭, 梁涛, 等. 基于lasso和elastic net的宽度学习系统网络结构稀疏方法[J]. 控制理论与应用, 2020, 37(12): 2543-2550.
[20] 杨光雨, 李晓航. 基于最大信息挖掘宽度学习系统短期电力负荷预测研究[J]. 电测与仪表, 2022, 59(3): 38-45.