基于红外热像的YOLOV8n-TOD列车障碍物检测算法
YOLOV8n-TOD Algorithm for Train Obstacle Detection Based on Infrared Thermal Imaging
DOI: 10.12677/mos.2025.141099, PDF,    科研立项经费支持
作者: 赵守俊*, 张轩雄:上海理工大学光电信息与计算机工程学院,上海;陈 嘉:上海申通地铁集团有限公司技术中心,上海;谢兰欣:苏州同睿兴科技有限公司,江苏 苏州
关键词: EIS Farneback Lucas-Kanade MobileNetV3 FasterBlockEIS Farneback Lucas-Kanade MobileNetV3 FasterBlock
摘要: 热成像可在低光环境下检测障碍物。针对列车颠簸影响图像质量的问题,基于ORB特征提取算法与Farneback、Lucas-Kanade光流法加权平均设计一种EIS算法,对采集的数据进行EIS及CLAHE预处理。同时,针对红外图像低分辨率、高噪声敏感性的问题,提出一种列车障碍物检测算法YOLOV8n-TOD,该算法从3个方面进行改进:在YOLOV8n算法中使用MobileNetV3网络替换原主干,通过轻量级结构和深度可分离卷积操作提高算法的计算效率和特征提取能力;在颈部网络中使用FasterBlock网络重构C2f模块,优化特征融合及增强信息传递,提高算法的稳定性与检测精度;优化CIOU损失函数,提高算法的泛化能力。测试结果显示:经预处理后YOLOV8n算法的mAP提高了2.4%;采用YOLOV8n-TOD算法后mAP又提升了7.2%,显著增强了障碍物检测能力。
Abstract: Thermal imaging can detect obstacles in low-light environments. To address the issue of image quality degradation caused by train vibrations, an EIS algorithm has been designed based on ORB feature extraction and weighted averaging using Farneback and Lucas-Kanade optical flow methods. The collected data undergoes EIS and preprocessing with CLAHE. To address the low resolution and high noise sensitivity of infrared images, a train obstacle detection algorithm, YOLOv8n-TOD, is proposed. The algorithm enhances YOLOv8n in three ways: replacing the original backbone with MobileNetV3 for efficient feature extraction using its lightweight structure and depthwise separable convolutions; by using FasterBlock networks to reconstruct the C2f module in the neck network, optimizing feature fusion and enhancing information transfer to improve model stability and detection Accuracy; and by refining the CIOU loss function to boost model generalization capability. Experimental results show that after preprocessing, the mAP of the YOLOV8n algorithm increased by 2.4%; with the YOLOV8n-TOD model, the mAP further improved by 7.2%, significantly enhancing obstacle detection performance.
文章引用:赵守俊, 陈嘉, 谢兰欣, 张轩雄. 基于红外热像的YOLOV8n-TOD列车障碍物检测算法[J]. 建模与仿真, 2025, 14(1): 1086-1099. https://doi.org/10.12677/mos.2025.141099

参考文献

[1] 沈拓, 钱沿佐, 谢兰欣, 等. 考虑反射强度的全自动运行列车障碍物检测算法研究[J]. 同济大学学报(自然科学版), 2022, 50(1): 6-12.
[2] Shen, T., Zhou, J., Yuan, T., Xie, Y. and Zhang, X. (2024) Lidar-Based Urban Three-Dimensional Rail Area Extraction for Improved Train Collision Warnings. Sensors, 24, Article 4963. [Google Scholar] [CrossRef] [PubMed]
[3] 陈钱, 隋修宝. 红外图像处理理论与技术[M]. 北京: 电子工业出版社, 2018: 7-11.
[4] 师帅. 轨道交通系统主动障碍物检测研究综述[J]. 机电工程技术, 2021, 50(6): 212-216.
[5] 崔晗. 基于热成像视频的铁路行人闯入检测系统[D]: [硕士学位论文]. 北京: 北京邮电大学, 2022.
[6] 许鑫龙. 基于目标增强融合的铁路异物侵限检测方法研究[D]: [硕士学位论文]. 北京: 北京交通大学, 2020.
[7] 孙永丽. 基于图像的铁路障碍物自动检测算法研究[D]: [硕士学位论文]. 西安: 西安电子科技大学, 2018.
[8] 杜开华, 许贵阳, 白堂博. 基于特征降冗余的Vanilla-YOLOv8铁路异物侵限检测方法[J/OL]. 北京交通大学学报, 1-12.
http://kns.cnki.net/kcms/detail/11.5258.U.20240903.1206.002.html, 2024-09-09.
[9] 吴浩楠, 史宏, 王瑞, 等. 基于改进YOLO v8的铁路人员入侵检测方法研究[J/OL]. 铁道科学与工程学报, 1-12. 2024-09-09.[CrossRef
[10] 王辉, 姜朱丰, 吴雨杰, 等. 基于深度学习的铁路异物侵限快速检测方法[J]. 铁道科学与工程学报, 2024, 21(5): 2086-2098.
[11] 李建国, 陈敬涛, 张伟, 等. 基于改进型SSD算法的铁路货场异物侵限小目标检测研究[J]. 铁道通信信号, 2024, 60(7): 57-62.
[12] 司全龙, 施婷. 基于超声波视频检测技术的列车运行异物检测系统研究[J]. 办公自动化, 2023, 28(4): 59-61+41.
[13] 何幸, 黄永明, 朱勇. 基于改进YOLOv5的路面坑洼检测方法[J]. 电子科技, 2024, 37(7): 53-59.
[14] 王小铸, 于莲芝. 基于卷积与自注意力聚合的小目标检测[J]. 电子科技, 2024, 37(2): 14-22.
[15] Rublee, E., Rabaud, V., Konolige, K. and Bradski, G. (2011) ORB: An Efficient Alternative to SIFT or Surf. 2011 Inter-national Conference on Computer Vision, Barcelona, 6-13 November 2011, 2564-2571. [Google Scholar] [CrossRef
[16] Farnebäck, G. (2003) Two-Frame Motion Estimation Based on Polynomial Expansion. In: Lecture Notes in Computer Science, Springer, 363-370. [Google Scholar] [CrossRef
[17] Baker, S. and Matthews, I. (2004) Lucas-Kanade 20 Years On: A Unifying Framework. International Journal of Computer Vision, 56, 221-255. [Google Scholar] [CrossRef
[18] Varghese, R. and M., S. (2024) YOLOv8: A Novel Object Detection Algorithm with Enhanced Performance and Robustness. 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), Chennai, 18-19 April 2024, 1-6. [Google Scholar] [CrossRef
[19] Howard, A., Sandler, M., Chen, B., Wang, W., Chen, L., Tan, M., et al. (2019) Searching for MobileNetV3. 2019 IEEE/ CVF International Conference on Computer Vision (ICCV), Seoul, 27 October-2 November 2019, 1314-1324. [Google Scholar] [CrossRef
[20] Chen, J., Kao, S., He, H., Zhuo, W., Wen, S., Lee, C., et al. (2023) Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 12021-12031. [Google Scholar] [CrossRef
[21] Zheng, Z., Wang, P., Ren, D., Liu, W., Ye, R., Hu, Q., et al. (2022) Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation. IEEE Transactions on Cybernetics, 52, 8574-8586. [Google Scholar] [CrossRef] [PubMed]
[22] Tagiew, R., Klasek, P., Tilly, R., Köppel, M., Denzler, P., Neumaier, P., et al. (2023) OSDaR23: Open Sensor Data for Rail 2023. 2023 8th International Conference on Robotics and Automation Engineering (ICRAE), Singapore, 17-19 November 2023, 270-276. [Google Scholar] [CrossRef