面向智能配送的外卖订单需求预测模型研究——基于随机森林
Research on Takeaway Order Demand Forecasting Model for Intelligent Delivery—Based on Random Forest
摘要: 为提升智能配送系统中外卖订单需求的预测精度以优化运营效率,本研究针对其时序性、非线性及高噪声特性,提出了一种基于随机森林(RF)与直方图梯度提升树(GBDT)的两阶段Stacking集成学习预测模型(RF-GBDT)。本研究基于人工采集的数据进行了系统性特征工程,并创新性地利用GBDT学习RF初步预测的残差以修正系统偏差。实证结果表明,模型有效识别出“时刻”为最关键特征,且RF-GBDT组合模型性能显著优于单一模型,测试集RMSE、MAE和MAPE分别降低了13.9%、19.0%和19.8%,R²提升至0.9016,证实了该框架能为外卖平台提供更精准可靠的需求预测解决方案。
Abstract: To enhance the prediction accuracy of takeaway order demand in intelligent delivery systems and optimize operational efficiency, this study addresses its temporal, nonlinear, and high-noise characteristics by proposing a two-stage Stacking ensemble learning prediction model (RF-GBDT) based on Random Forest (RF) and Histogram-based Gradient Boosting Decision Tree (GBDT). The study systematically performs feature engineering based on manually collected data and innovatively employs GBDT to learn the residuals of preliminary RF predictions to correct systemic biases. Empirical results demonstrate that the model effectively identifies “time of day” as the most critical feature, and the RF-GBDT combined model significantly outperforms individual models, reducing RMSE, MAE, and MAPE on the test set by 13.9%, 19.0%, and 19.8%, respectively, while increasing R2 to 0.9016. This validates that the framework can provide more accurate and reliable demand prediction solutions for takeaway platforms.
文章引用:陈怡璇, 张峥. 面向智能配送的外卖订单需求预测模型研究——基于随机森林[J]. 运筹与模糊学, 2025, 15(5): 155-166. https://doi.org/10.12677/orf.2025.155239

参考文献

[1] 焦扬庆, 张世文, 颜芳, 等. 基于特征筛选与随机森林的土壤有机质空间预测[J/OL]. 农业环境科学学报, 1-14.
https://link.cnki.net/urlid/12.1347.S.20250826.1558.004, 2025-09-01.
[2] 曹岚, 周成功, 袁斌霞, 等. 基于梯度提升回归树-自训练贝叶斯优化支持向量机的光伏阵列故障诊断方法[J]. 太阳能学报, 2025, 46(6): 289-297.
[3] Chai, S.H., Lim, J.S., Yoon, H. and Wang, B. (2024) A Novel Methodology for Forecasting Business Cycles Using ARIMA and Neural Network with Weighted Fuzzy Membership Functions. Axioms, 13, Article 56. [Google Scholar] [CrossRef
[4] Pliakos, K., Geurts, P. and Vens, C. (2018) Global Multi-Output Decision Trees for Interaction Prediction. Machine Learning, 107, 1257-1281. [Google Scholar] [CrossRef
[5] Tanizaki, T., Hoshino, T., Shimmura, T. and Takenaka, T. (2019) Demand Forecasting in Restaurants Using Machine Learning and Statistical Analysis. Procedia CIRP, 79, 679-683. [Google Scholar] [CrossRef
[6] 彭龙, 韩国庆, 邬书豪, 等. 基于机器学习算法的CO2腐蚀速率预测[J]. 西安石油大学学报(自然科学版), 2023, 38(2): 113-121.
[7] 商文颖, 赵琳, 杨朔, 等. 基于多注意力融合与监督学习的城市能耗预测[J]. 北京邮电大学学报, 2025, 48(4): 33-38.
[8] 刘洋, 高燕琳, 史芸萍, 等. 基于ARIMA模型和ARIMA-SVM组合模型的流行性感冒的发病预测研究[J]. 首都公共卫生, 2024, 18(4):195-200.
[9] 成振华, 匡昭敏, 陶伟, 等. 基于随机森林的多蔗区甘蔗产量预测[J/OL]. 甘蔗糖业: 1-8.
https://link.cnki.net/urlid/44.1210.TS.20250815.1017.010, 2025-08-26.
[10] 张宵杰, 李根, 解建勋, 等. 基于随机森林和XGBoost算法的管内流动沸腾传热特性预测研究[J/OL]. 北京航空航天大学学报: 1-17.
https://link.cnki.net/urlid/11.2625.V.20250807.1500.005, 2025-08-21.
[11] 亢金涛, 许振浩, 谢辉辉, 等. 基于钻进数据定量解译的隧道围岩完整性评价[J/OL]. 中南大学学报(自然科学版): 1-15.
https://link.cnki.net/urlid/43.1426.N.20250819.2231.002, 2025-08-21.