基于XGBoost算法的短期交通流预测
Short-Term Traffic Flow Prediction Based on XGBoost
摘要:
针对短期交通流预测问题,为完成实时精准预测,建立了一种基于Huber损失的极端梯度上升(Extreme Gradient Boosting, XGBoost)短时交通流预测模型。通过对交通流数据周期性、关联性的分析,提取时间特征,并进行时间特征重要性分析。利用该模型以及提取的特征进行交通流预测,实验结果表明:该模型优于基于均方误差损失的极端梯度上升模型以及基于平均绝对误差损失的极端梯度上升模型。同时,该模型较梯度提升回归模型、支持向量机回归模型具有更高的预测精度,各误差指标小,且模型训练时间短,符合短时交通流预测所要求的时效性。
Abstract:
For short-term traffic flow prediction, in order to complete real-time accurate prediction, an extreme gradient boosting (XGBoost) short-term traffic flow prediction model based on Huber loss is established. By analyzing the periodicity and relevance of traffic flow data, time features are extracted and feature importance analysis is performed. Using this model and the extracted features for traffic flow prediction, the experimental results show that the model is superior to the extreme gradient boosting model based on mean square error loss and the extreme gradient boosting model based on average absolute error loss. At the same time, the model has higher prediction accuracy than gradient boosting regression model and support vector machine regression model, each error index is small, and the model training time is short, which meets the timeliness required by short-term traffic flow prediction.
参考文献
|
[1]
|
李敏, 黄迟. 集成学习下的短期交通流预测[J]. 济南大学学报(自然科学版), 2019, 33(5): 390-395.
|
|
[2]
|
Ahmed, M.S. and Cook, A.R. (1979) Analysis of Freeway Traffic Time-Series Data by Using Box-Jenkins Technique. Transportation Research Board, 722, 1-9.
|
|
[3]
|
Li, Y., Xiao, J., et al. (2016) Multiple Measures-Based Chaotic Time Series for Traffic Flow Prediction Based on Bayesian Theory. Nonlinear Dynamics, 85, 179-194. [Google Scholar] [CrossRef]
|
|
[4]
|
Wei, H., Cheng, Z., Sotelo, M.A., et al. (2017) Short-Term Vessel Traffic Flow Forecasting by Using an Improved Kalman Model. Cluster Computing, No. 10, 1-10.
|
|
[5]
|
Guo, J.H. and Williams, B.M. (2010) Real-Time Short-Term Traffic Speed Level Forecasting and Uncertainty Quantification using Layered Kalman Filters. Transportation Research Record: Journal of the Transportation Research Board, 2175, 28-37. [Google Scholar] [CrossRef]
|
|
[6]
|
李晓磊, 肖进丽, 刘明俊. 基于SARIMA模型的船舶交通流量预测研究[J]. 武汉理工大学学报(交通科学与工程版), 2017, 41(2): 329-332.
|
|
[7]
|
陆化普, 孙智源, 屈闻聪. 基于时空模型的交通流故障数据修正方法[J]. 交通运输工程学报, 2015, 15(6): 92-100.
|
|
[8]
|
钟颖, 邵毅明, 吴文文. 基于XGBoost的短时交通流预测模型[J]. 科学技术与工程, 2019, 19(30): 338-342.
|
|
[9]
|
苏美红, 张海. 基于不同损失函数的模型选择和正则化学习方法[J]. 纺织高校基础科学学报, 2014, 27(4): 464.
|
|
[10]
|
叶景, 李丽娟, 唐臻旭. 基于CNN-XGBoost的短时交通流预测[J]. 计算机工程与设计, 2020, 41(4): 1081-1086.
|