基于多尺度时空注意力机制的车辆换道意图预测
Vehicle Lane Change Intention Prediction Based on Multi-Scale Spatial-Temporal Attention Mechanism
摘要: 车辆换道行为预测是智能交通系统和自动驾驶技术的核心组成部分,对提升道路交通安全具有重要意义。针对现有方法在时空特征建模分离、多尺度信息融合不足和动态权重调整缺失等方面的技术挑战,本文提出了一种基于多尺度时空注意力机制的车辆换道意图预测模型(MSTAN)。该模型采用分治协同的设计思想,构建了三个并行处理分支:时序特征建模分支基于双向长短时记忆网络和自注意力机制捕获车辆运动状态的时间依赖关系;空间交互建模分支利用图卷积网络和边注意力机制建模车辆间的动态空间关系;多尺度特征融合分支通过并行多分辨率卷积和注意力机制提取多粒度上下文信息。模型采用自适应权重融合策略实现三个分支特征的最优整合。基于NGSIM数据集的实验结果表明,MSTAN在US-101、I-80和Lankershim三个路段的换道行为三分类任务中分别达到98.3%、97.6%和95.2%的预测准确率,优于所有基线方法。并通过分支贡献度分析和注意力机制有效性评估,定量验证了各功能模块对模型性能的独立贡献和协同效应。时间窗口敏感性分析和预测时间提前量分析确定了最优的时序建模参数配置,证明了模型在不同预测场景下的稳定性和鲁棒性。
Abstract: Vehicle lane change behavior prediction is a core component of intelligent transportation systems and autonomous driving technologies, playing a crucial role in enhancing road traffic safety. Addressing the technical challenges of existing methods in spatial-temporal feature modeling separation, insufficient multi-scale information fusion, and the absence of dynamic weight adjustment, this paper proposes a Multi-Scale Spatial-Temporal Attention Network (MSTAN) for vehicle lane change intention prediction. The model adopts a divide-and-conquer collaborative design philosophy, constructing three parallel processing branches: the temporal feature modeling branch captures temporal dependencies of vehicle motion states based on bidirectional long short-term memory networks and self-attention mechanisms; the spatial interaction modeling branch utilizes graph convolutional networks and edge attention mechanisms to model dynamic spatial relationships between vehicles; the multi-scale feature fusion branch extracts multi-granularity contextual information through parallel multi-resolution convolutions and attention mechanisms. The model employs an adaptive weight fusion strategy to achieve optimal integration of features from the three branches. Experimental results on the NGSIM dataset demonstrate that MSTAN achieves prediction accuracies of 98.3%, 97.6%, and 95.2% for lane change behavior three-class classification tasks on the US-101, I-80, and Lankershim road segments, respectively, outperforming all baseline methods. Through branch contribution analysis and attention mechanism effectiveness evaluation, the independent contributions and synergistic effects of each functional module on model performance are quantitatively validated. Time window sensitivity analysis and prediction lead time analysis determine the optimal temporal modeling parameter configuration, demonstrating the model’s stability and robustness across different prediction scenarios.
文章引用:吴林蓉. 基于多尺度时空注意力机制的车辆换道意图预测[J]. 建模与仿真, 2025, 14(10): 263-276. https://doi.org/10.12677/mos.2025.1410622

参考文献

[1] 王庆荣, 韩芳文, 朱昌锋, 等. 频域-时空协同框架下的车辆轨迹预测及意图识别模型[J/OL]. 计算机工程与应用, 1-14.
https://link.cnki.net/urlid/11.2127.tp.20250721.1155.010, 2025-09-07.
[2] Zamanpour, M., He, S., Levin, M.W. and Sun, Z. (2025) Incorporating Lane-Change Prediction into Energy-Efficient Speed Control of Connected Autonomous Vehicles at Intersections. Transportation Research Part C: Emerging Technologies, 171, Article ID: 104968. [Google Scholar] [CrossRef
[3] Fafoutellis, P., Plymenos-Papageorgas, J. and Vlahogianni, E.I. (2022) Enhancing Lane Change Prediction at Intersections with Spatio-Temporal Adequacy Information. Journal of Big Data Analytics in Transportation, 4, 73-84. [Google Scholar] [CrossRef
[4] Nejadhossein Qasemabadi, A., Mozaffari, S., Ahmadi, M. and Alirezaee, S. (2025) Privacy-Preserving Lane Change Prediction Using Recurrent Neural Network with Secure Multiparty Computation. International Journal of Intelligent Transportation Systems Research, 23, 893-907. [Google Scholar] [CrossRef
[5] De Cristofaro, F., Hofbaur, F., Yang, A., et al. (2025) Prediction of Lane Change Intentions of Human Drivers Using an LSTM, a CNN and a Transformer.
[6] Deo, N. and Trivedi, M.M. (2018) Convolutional Social Pooling for Vehicle Trajectory Prediction. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, 18-22 June 2018, 1468-1476. [Google Scholar] [CrossRef
[7] 郭瑞军, 范超冉, 付明迪. 基于PSO-GRU模型的车辆换道意图识别[J/OL]. 吉林大学学报(工学版), 1-12. 2025-09-07. [Google Scholar] [CrossRef
[8] Kipf, T.N. and Welling, M. (2016) Semi-Supervised Classification with Graph Convolutional Networks.
[9] Veličković, P., Cucurull, G., Casanova, A., et al. (2017) Graph Attention Networks. arXiv:1710.10903.
[10] Krajewski, R., Bock, J., Kloeker, L. and Eckstein, L. (2018) The highD Dataset: A Drone Dataset of Naturalistic Vehicle Trajectories on German Highways for Validation of Highly Automated Driving Systems. 2018 21st International Conference on Intelligent Transportation Systems (ITSC), Maui, 4-7 November 2018, 2118-2125. [Google Scholar] [CrossRef
[11] 邹加, 侯康宁, 郑芳芳, 等. 车辆换道的时空影响分析: 交通安全、效率和油耗视角[J]. 交通运输工程与信息学报, 2025, 23(3): 171-184.
[12] 任立海, 康鈺泽, 刘煜, 等. 融合逻辑判断机制的CNN-GRU换道意图识别方法[J]. 湖南大学学报(自然科学版), 2025, 52(6): 69-77.
[13] Chen, T. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, 13-17 August 2016, 785-794. [Google Scholar] [CrossRef
[14] Sainath, T.N., Vinyals, O., Senior, A. and Sak, H. (2015) Convolutional, Long Short-Term Memory, Fully Connected Deep Neural Networks. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, 19-24 April 2015, 4580-4584. [Google Scholar] [CrossRef
[15] Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
[16] Alahi, A., Goel, K., Ramanathan, V., Robicquet, A., Fei-Fei, L. and Savarese, S. (2016) Social LSTM: Human Trajectory Prediction in Crowded Spaces. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 961-971. [Google Scholar] [CrossRef