基于孤立森林算法贝叶斯优化的Bi-LSTM交通流量预测
Traffic Flow Prediction Based on Isolated Forest Algorithm and Bayesian Optimization of Bi-LSTM
摘要: 精确的交通流预测是建设智慧城市的主要问题之一,能为城市发展、城市建设以及交通管控等提供更科学的依据和参考。本文针对短时交通流预测模型中存在的一些不足点进行改进,设计了一种基于孤立森林算法优化下的Bi-LSTM多步交通流预测模型,该模型充分考虑了在日常交通环境下存在某些不确定因素对于相邻天数中同样时间的交通流量影响,对于原始交通流数据集使用孤立森林算法进行异常值判别和清洗,提升了原始交通流数据集的可靠性,使用贝叶斯优化算法寻找网络模型的超参数,最后利用双向长短记忆网络(Bi-LSTM)对时间序列数据的连续性和周期性的特点进行多步交通流预测。采用江苏省某地级市的道路交通流数据进行试验,试验结果表明基于孤立森林算法优化下的Bi-LSTM具备精确的交通流预测能力,在预测T + 1时刻的交通流数据上,与BP、LSTM、Bi-LSTM、EMD-LSTM和PSO-BiLSTM相比,平均绝对误差(MEA)下降了16.3265、15.5185、12.6785、6.2758和0.44,均方根误差(RMES)下降了22.7283、22.4023、14.9303、7.6905和0.5509,判定系数R2提升了7.604%、7.391%、3.701%、1.182%和0.025%。在预测T + 3时刻的交通流数据上,平均绝对误差(MEA)下降了14.9669、17.3019、8.6179、1.4647和0.1997,均方根误差(RMES)下降了21.4897、23.9447、11.9737、1.7354和0.8882,判定系数R2提升了6.589%、7.893%、2.602%、0.211%和0.11%。
Abstract: Accurate traffic flow prediction remains a crucial concern in the advancement of smart urban settings, offering a more empirical foundation and guidance for urban development, city planning, and traffic management strategies. This research tackles some of the inadequacies within short- term traffic flow prediction models and introduces an enhancement through the formulation of a Bi-LSTM multi-step traffic flow prediction model, optimized using the Isolation Forest algorithm. This model takes into comprehensive consideration the uncertainties inherent in daily traffic conditions, which influence the traffic volumes during corresponding time intervals across consecutive days. By employing the Isolation Forest algorithm, the raw traffic flow dataset undergoes anomaly detection and cleansing, thereby augmenting the robustness of the original data. The Bayesian optimization algorithm is applied to identify optimal hyperparameters for the network model. Harnessing the bidirectional long short-term memory network (Bi-LSTM), this model capitalizes on the inherent continuity and periodicity attributes of time series data to forecast multi-step traffic flow patterns. Empirical trials are conducted utilizing road traffic flow data from a prefecture-level city in Jiangsu Province. Empirical results affirm that the Isolation Forest algorithm-enhanced Bi-LSTM model demonstrates noteworthy prowess in accurate traffic flow prediction. When forecasting traffic flow at time step T + 1, in contrast to conventional BP, LSTM, Bi-LSTM, EMD-LSTM, and PSO-BiLSTM models, there has been a reduction in Mean Absolute Error (MAE) by 16.3265, 15.5185, 12.6785, 6.2758, and 0.44, respectively, as well as a decrease in Root Mean Square Error (RMSE) by 16.3265, 15.5185, 12.6785, 6.2758, and 0.5509, respectively. The coefficient of determination, R-squared (R2), has shown an improvement of 7.604%, 7.391%, 3.701%, 1.182%, and 0.025%, respectively. For the prediction of traffic flow data at time step T + 3, the MAE has been lowered by 14.9669, 17.3019, 8.6179, 1.4647, and 0.1997, while the RMSE has decreased by 21.4897, 23.9447, 11.9737, 1.7354, and 0.8882, respectively. Additionally, the R-squared (R2) value has increased by 6.589%, 7.893%, 2.602%, 0.211%, and 0.11%, respectively.
文章引用:黄书林, 韩印. 基于孤立森林算法贝叶斯优化的Bi-LSTM交通流量预测[J]. 建模与仿真, 2024, 13(4): 4205-4216. https://doi.org/10.12677/mos.2024.134381

参考文献

[1] 杨凤满. 基于人工神经网络的交通流预测方法综述[J]. 公路交通科技, 2020(37): 130-135.
[2] 吴晋武, 张海峰, 冉旭东. 基于数据约减和支持向量机的非参数回归短时交通流预测算法[J]. 公路交通科技, 2020, 37(7): 129-134.
[3] 白伟华, 张传斌, 张塽旖, 周腾. 基于异常值识别卡尔曼滤波器的短期交通流预测[J]. 计算机应用研究, 2021, 38(3): 817-821.
[4] Williams, B.M. and Hoel, L.A. (2003) Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results. Journal of Transportation Engineering, 129, 664-672. [Google Scholar] [CrossRef
[5] Polson, N.G. and Sokolov, V.O. (2017) Deep Learning for Short-Term Traffic Flow Prediction. Transportation Research Part C: Emerging Technologies, 79, 1-17. [Google Scholar] [CrossRef
[6] 陈王勇, 胡尧. 基于PELT的交通流状态检测与短期预测研究[J]. 公路交通科技, 2022, 39(1): 120-129.
[7] Miglani, A. and Kumar, N. (2019) Deep Learning Models for Traffic Flow Prediction in Autonomous Vehicles: A Review, Solutions, and Challenges. Vehicular Communications, 20, Article ID: 100184. [Google Scholar] [CrossRef
[8] Wang, J., Deng, W. and Guo, Y. (2014) New Bayesian Combination Method for Short-Term Traffic Flow Forecasting. Transportation Research Part C: Emerging Technologies, 43, 79-94. [Google Scholar] [CrossRef
[9] Ma, X., Tao, Z., Wang, Y., Yu, H. and Wang, Y. (2015) Long Short-Term Memory Neural Network for Traffic Speed Prediction Using Remote Microwave Sensor Data. Transportation Research Part C: Emerging Technologies, 54, 187-197. [Google Scholar] [CrossRef
[10] 王博文, 王景升, 朱茵, 等. 基于ARMA-SVR的短时交通流量预测模型研究[J]. 公路交通科技, 2021, 38(11): 126-133.
[11] 郭佳丽, 邢双云, 栾昊, 等. 基于改进的LSTM算法的时间序列流量预测[J]. 南京信息工程大学学报(自然科学版), 2021, 13(5): 571-575.
[12] 邴其春, 张伟健, 沈富鑫, 等. 基于变分模态分解和LSTM的短时交通流预测[J]. 重庆理工大学学报(自然科学), 2023, 37(10): 169-177.
[13] 许宏科, 刘佳, 刘栩涛, 等. 基于PCA-LSTM的城市短时交通流预测研究[J]. 公路交通技术, 2021, 37(2): 115-118, 132.
[14] Liu, F.T., Ting, K.M. and Zhou, Z. (2012) Isolation-based Anomaly Detection. ACM Transactions on Knowledge Discovery from Data, 6, 1-39. [Google Scholar] [CrossRef
[15] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[16] Méndez, M., Merayo, M.G. and Núñez, M. (2023) Long-Term Traffic Flow Forecasting Using a Hybrid CNN-Bilstm Model. Engineering Applications of Artificial Intelligence, 121, Article ID: 106041. [Google Scholar] [CrossRef
[17] 牟振华, 李克鹏, 申栋夫. 基于小波降噪与贝叶斯神经网络联合模型的短时交通流量预测[J]. 科学技术与工程, 2020, 20(33): 13881-13886.