电商运输准时性的可解释预测:基于BO与集成学习
Explainable On-Time Delivery Prediction for E-Commerce: A BO and Ensemble Learning Approach
摘要: 随着电子商务的快速发展,运输准时性已成为影响客户体验与企业竞争力的核心因素。然而,传统预测方法难以应对物流环节中的复杂不确定性。为此,本文构建了一个融合贝叶斯优化、Stacking集成学习与SHAP可解释性分析的运输准时性预测框架。首先,基于Kaggle公开电商数据集进行特征工程;随后,应用贝叶斯优化对LightGBM、RF等多种机器学习模型进行超参数调优,并以Stacking方法集成,以提升对延迟订单的识别能力。实验结果表明,经贝叶斯优化后,模型平均召回率提升约7%,而集成模型取得了最优的综合性能,召回率达到73.64%。SHAP可解释性分析进一步揭示,折扣率和商品重量是影响模型输出的两个最强有力的驱动因素。本研究不仅提供了一套现实可行的预测框架,更通过可解释分析为电商物流的主动时效管理与精准干预提供了决策依据。
Abstract: With the rapid development of e-commerce, on-time delivery has become a critical factor influencing customer experience and corporate competitiveness. However, traditional prediction methods struggle to address the complex uncertainties inherent in logistics processes. To tackle this, this paper constructs a transportation timeliness prediction framework that integrates Bayesian optimization, Stacking ensemble learning, and SHAP interpretability analysis. First, feature engineering is performed based on a publicly available e-commerce dataset from Kaggle. Subsequently, Bayesian optimization is applied to fine-tune the hyperparameters of multiple machine learning models, including LightGBM and Random Forest, and these models are integrated using the Stacking method to enhance the identification capability for delayed orders. Experimental results show that after Bayesian optimization, the average recall rate of the models improved by approximately 7%, and the ensemble model achieved the best overall performance, with a recall rate of 73.64%. SHAP interpretability analysis further reveals that the discount rate and product weight are the two most powerful driving factors influencing model output. This study not only provides a practical and feasible prediction framework but also offers decision-making support for proactive timeliness management and precise intervention in e-commerce logistics through interpretable analysis.
文章引用:赵天煜, 罗鄂湘, 贾泽如. 电商运输准时性的可解释预测:基于BO与集成学习[J]. 电子商务评论, 2026, 15(3): 175-184. https://doi.org/10.12677/ecl.2026.153261

参考文献

[1] 国家邮政局. 2026年全国邮政工作会议[R]. 北京: 国家邮政局, 2026.
[2] Archetti, C. and Bertazzi, L. (2020) Recent Challenges in Routing and Inventory Routing: E‐Commerce and Last‐Mile Delivery. Networks, 77, 255-268. [Google Scholar] [CrossRef
[3] 周永务, 李斐. 新零售运营管理面临的问题与挑战[J]. 系统管理学报, 2022, 31(6): 1041-1055.
[4] Kandula, S., Krishnamoorthy, S. and Roy, D. (2021) A Prescriptive Analytics Framework for Efficient E-Commerce Order Delivery. Decision Support Systems, 147, Article ID: 113584. [Google Scholar] [CrossRef
[5] Lin, C., Wu, H. and Chang, Y. (2011) The Critical Factors Impact on Online Customer Satisfaction. Procedia Computer Science, 3, 276-281. [Google Scholar] [CrossRef
[6] 曹允春, 朱俊洪. 跨境电商视角下机场航空物流发展对策探析[J]. 铁道运输与经济, 2021, 43(4): 61-65.
[7] Giuffrida, N., Fajardo-Calderin, J., Masegosa, A.D., Werner, F., Steudter, M. and Pilla, F. (2022) Optimization and Machine Learning Applied to Last-Mile Logistics: A Review. Sustainability, 14, Article 5329. [Google Scholar] [CrossRef
[8] Karim, M.A., Samaranayake, P., Smith, A.J.R. and Halgamuge, S.K. (2009) An On-Time Delivery Improvement Model for Manufacturing Organisations. International Journal of Production Research, 48, 2373-2394. [Google Scholar] [CrossRef
[9] Gheidari, S.K. (2025) Design and Training a Routing Model for Predicting E-Commerce Logistics Distribution Systems in Transportation and Delivery of Educational Goods for Students Using Existing Algorithms, Big Data Analysis, and Machine Learning Methods. Pegem Journal of Education and Instruction, 15, 330-360.
[10] Shawon, R.E.R., Supti, A.M., Jobaer, A.A., et al. (2025) Enhancing Supply Chain Resilience across U.S. Regions Using Machine Learning and Logistics Performance Analytics. International Journal of Applied Mathematics, 38, 182-205. [Google Scholar] [CrossRef
[11] Ye, B., Zuo, J., Zhao, X. and Luo, L. (2016) Research on the Express Delivery Delay Prediction Based on Neural Network in the Background of Big Data. Proceedings of the 6th International Conference on Electronic, Mechanical, Information and Management Society, Shenyang, 1-3 April 2016, 1449-1454. [Google Scholar] [CrossRef
[12] 江乾坤, 王成哲. 互联网企业海外并购财务风险大数据预警研究——基于Stacking集成学习[J]. 技术经济, 2023, 42(9): 147-160.
[13] Lundberg, S.M. and Lee, S.I. (2017) A Unified Approach to Interpreting Model Predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 4768-4777.
[14] Pratama, N., Anrahvi, R., Tambal, A. and Singh, A. (2025) Classification of E-Commerce Shipping Timeliness Using Supervised Learning Algorithm. Public Research Journal of Engineering, Data Technology and Computer Science, 3, 59-69. [Google Scholar] [CrossRef
[15] Al Rahib, M.A., Saha, N., Mia, R. and Sattar, A. (2024) Customer Data Prediction and Analysis in E-Commerce Using Machine Learning. Bulletin of Electrical Engineering and Informatics, 13, 2624-2633. [Google Scholar] [CrossRef