基于骨架数据的列车司机异常行为检测
Detection of Abnormal Behavior of Train Drivers Based on Skeleton Data
摘要: 针对列车司机异常行为检测时存在准确率以及效率低的问题,在时空图卷积网络(Spatial Temporal Graph Convolutional Network, ST-GCN)行为检测模型的基础上,提出一种基于骨架数据的列车司机异常行为检测(Spatiotemporal Graph Attention and Multi-Time Scale Temporal Convolutional Network, ST-GAT)模型。通过利用骨架数据进行建模,并引入图注意力网络(Graph Attention Network, GAT)模块,利用动态关注机制以提升ST-GCN模型对空间特征的提取能力。通过提出的多时间尺度时域卷积网络(Multi-Scale Temporal Convolutional Network,MS-TCN)模块进行不同时间尺度下的时序提取特征,扩大时域卷积网络(Temporal Convolutional Network, TCN)模块感受野范围,克服原始TCN模块不灵活、无法检测不同时间尺度的问题,提高了模型的准确率,同时使用交叉熵损失函数克服了模型训练过程收敛速度慢的问题。实验结果表明,本文方法在测试集上比ST-GCN模型准确率提高8.8%,FLOPS提高2%。因此,所提出的方法在提高列车司机异常行为检测的准确率和效率方面表现出较好的性能。
Abstract: Aiming at the challenges of low detection accuracy and efficiency in train driver abnormal behavior detection, this study proposes a model based on skeleton data, named Spatiotemporal Graph Attention and Multi-Time Scale Temporal Convolutional Network (ST-GAT), built upon the Spatial Temporal Graph Convolutional Network (ST-GCN) behavior detection model. By leveraging skeleton data for modeling, the model introduces a Graph Attention Network (GAT) module with a dynamic attention mechanism to enhance the ST-GCN model’s extraction of spatial features. Additionally, the model incorporates a Multi-Scale Temporal Convolutional Network (MS-TCN) module to extract temporal features at different scales, addressing the flexibility and scale limitations of the Temporal Convolutional Network (TCN) module in ST-GCN. To improve training efficiency, the model adopts the cross-entropy loss function to expedite convergence. Experimental results demonstrate an 8.8% improvement in accuracy and a 2% increase in FLOPS on the test set compared to the ST-GCN model. Therefore, the proposed method exhibits favorable performance in enhancing both the accuracy and efficiency of abnormal behavior detection.
文章引用:朱高伟. 基于骨架数据的列车司机异常行为检测[J]. 计算机科学与应用, 2024, 14(7): 42-50. https://doi.org/10.12677/csa.2024.147162

参考文献

[1] 秦斌斌, 彭良康, 卢向明, 等. 司机分心驾驶检测研究进展[J]. 计算机应用, 2021, 41(8): 2330-2337.
[2] 肖致明, 刘文, 杨建芳, 等. 心理状态对火车司机安全绩效影响综述[J]. 中国安全科学学报, 2022, 32(S2): 13-18.
[3] 王洪涛, 殷浩钧, 陈创泉, 等. 基于脑电信号的驾驶疲劳检测综述[J]. 华中科技大学学报(自然科学版), 2022, 50(11): 54-65.
[4] Liu, L., Ji, Y.F., Gao, Y., et al. (2021) A Novel Fatigue Driving State Recognition and Warning Method Based on EEG and EOG Signals. Journal of Healthcare Engineering, 2021, Article ID: 7799793. [Google Scholar] [CrossRef] [PubMed]
[5] 惠飞, 郭静, 贾硕, 等. 基于双向长短记忆网络的异常驾驶行为检测[J]. 计算机工程与应用, 2020, 56(24): 116-122.
[6] 李小平, 白超. 一种基于多模态信息融合的火车司机疲劳驾驶检测方法[J]. 铁道学报, 2022, 44(6): 56-65.
[7] 栗志磊, 李俊, 施智平, 等. 用于视频行为识别的高效二维时序建模网络[J]. 计算机工程与应用, 2023, 59(3): 127-134.
[8] 张冰冰, 葛疏雨, 王旗龙, 等. 基于多阶信息融合的行为识别方法研究[J]. 自动化学报, 2021, 47(3): 609-619.
[9] Dai, C., Liu, X. and Lai, J. (2020) Human Action Recognition Using Two-Stream Attention Based LSTM Networks. Applied Soft Computing, 86, Article 105820. [Google Scholar] [CrossRef
[10] 甄昊宇, 张德. 结合自适应图卷积与时态建模的骨架动作识别[J]. 计算机工程与应用, 2023, 59(18): 137-144.
[11] 孙晓虎, 余阿祥, 申栩林, 等. 混合注意力机制的异常行为识别[J]. 计算机工程与应用, 2023, 59(5): 140-147.
[12] Zhao, Y., Man, K.L., Smith, J., et al. (2020) Improved Two-Stream Model for Human Action Recognition. EURASIP Journal on Image and Video Processing, 2020, Article No. 24. [Google Scholar] [CrossRef
[13] Zhang, H.B., Zhang, Y.X., Zhong, B., et al. (2019) A Comprehensive Survey of Vision-Based Human Action Recognition Methods. Sensors, 19, Article 1005. [Google Scholar] [CrossRef] [PubMed]
[14] 张晓平, 纪佳慧, 王力, 等. 基于视频的人体异常行为识别与检测方法综述[J]. 控制与决策, 2022, 37(1): 14-27.
[15] Yu, C., Xiao, B., Gao, C., et al. (2021) Lite-HRNet: A Lightweight High-Resolution Network. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 10440-10450. [Google Scholar] [CrossRef
[16] Li, C., Zhong, Q., Xie, D., et al. (2017) Skeleton-Based Action Recognition with Convolutional Neural Networks. 2017 IEEE International Conference on Multimedia & Expo Workshops(ICMEW), Hong Kong, 10-14 July 2017, 597-600.
[17] 尹智帅, 钟恕, 聂琳真, 等. 基于人体姿态估计的分心驾驶行为检测[J]. 中国公路学报, 2022, 35(6): 312-323.
[18] Morais, R., Le, V., Tran, T., et al. (2019) Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 15-20 June 2019, 11996-12004. [Google Scholar] [CrossRef
[19] Song, S., Lan, C., Xing, J.L., et al. (2017) An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data. Proceedings of the AAAI Conference on Artificial Intelligence, San Francisco, 4-9 February 2017, 4263-4270. [Google Scholar] [CrossRef
[20] Yan, S.J., Xiong, Y.J. and Lin, D.H. (2018) Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, 2-7 February 2018, 7444-7452. [Google Scholar] [CrossRef
[21] 刘宝龙, 周森, 董建锋, 等. 基于骨架的人体动作识别技术研究进展[J]. 计算机辅助设计与图形学学报, 2023, 35(9): 1299-1322.
[22] 钱慧芳, 易剑平, 付云虎. 基于深度学习的人体动作识别综述[J]. 计算机科学与探索, 2021, 15(3): 438-455.
[23] Sun, K., Xiao, B., Liu, D., et al. (2019) Deep High-Resolution Representation Learning for Human Pose Estimation. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, 15-20 June 2019, 5693-5703. [Google Scholar] [CrossRef
[24] Velickovic, P., Cucurull, G., Casanova, A., et al. (2017) Graph Attention Networks. arXiv:1710.10903.
[25] 邓淼磊, 高振东, 李磊, 等. 基于深度学习的人体行为识别综述[J]. 计算机工程与应用, 2022, 58(13): 14-26.
[26] 肖振久, 孔祥旭, 宗佳旭, 等. 自适应聚焦损失的图像目标检测算法[J]. 计算机工程与应用, 2021, 57(23): 185-192.
[27] 孟宪法, 刘方, 李广, 等. 卷积神经网络压缩中的知识蒸馏技术综述[J]. 计算机科学与探索, 2021, 15(10): 1812-1829.
[28] 张硕, 叶勤, 史婧, 等. 改进RangeNet++损失函数的车载点云小目标语义分割方法[J]. 计算机辅助设计与图形学学报, 2021, 33(5): 704-711.
[29] 刘炎, 熊雨露, 褚菲, 等. 基于多源异构信息自适应融合的镁砂熔炼过程运行状态评价[J]. 仪器仪表学报, 2023, 44(9): 322-337.