基于改进YOLOv5用于无人机图像的实时车辆检测
Real-Time Vehicle Detection Based on Improved YOLOv5 for Drone Images
DOI: 10.12677/SEA.2023.125068, PDF,  被引量   
作者: 何佳桥, 葛晓东:上海理工大学光电信息与计算机工程学院,上海
关键词: 车辆检测YOLOv5DSConv自适应损失函数Vehicle Detection YOLOv5 DSConv Adaptive Loss Function
摘要: 如今,无人机(UAV)逐渐被用于各个领域,例如交通监控和智能停车,其中对车辆的实时监测和分类是关键任务之一。在车辆检测方面存在许多挑战,例如小型车辆目标和无人机工作时飞行角度的变化导致目标尺度变化从而给车辆检测网络模型的优化带来的负担。此外,由于高空飞行时航拍图像目标较小、可提取特征较少,而导致模型检测精度较低。为解决上述问题,本文旨在基于YOLOv5算法,提出了一种准确、高效、实时的车辆检测网络。首先,为了使模型更好地提取小目标特征,在Neck部分添加了一个新的连接层,将第一个C3层的高分辨率特征映射连接到Neck部分。其次,为使模型更加专注小目标,添加一个步幅为4的输出层作为新的头部。同时优化模型对较大车辆目标的检测较少,我们去除掉了Head中输出特征图为20 × 20的检测头。同时,考虑到模型的推理速度,将Neck部分的C3模块替换为更轻量化的DS_C3模块。最后,为了进一步提高基于IOU损失函数的性能,将CIOU替换为α-IOU。本文使用VisDrone2019数据集,并基于改进算法和原始算法分别进行了实验,结果表明,本文的算法能够对对小目标进行有效的实时检测。
Abstract: Today, unmanned aerial vehicles (UAVs) are gradually being used in various fields, such as traffic monitoring and smart parking, where real-time monitoring and classification of vehicles is one of the key tasks. There are many challenges in vehicle detection, such as changes in flight angle when small vehicle targets and drones are working, resulting in changes in target scale, which puts a burden on the optimization of vehicle detection network models. In addition, due to the small aerial image target and fewer extractable features during high-altitude flight, the model detection accuracy is low. To solve the above problems, this paper aims to propose an accurate, efficient and real-time vehicle detection network based on the YOLOv5 algorithm. First, in order to make the model better extract small target features, a new connection layer is added to the Neck part, connecting the high-resolution feature mapping of the first C3 layer to the Neck section. Second, to make the model more focused on small targets, an output layer with a stride length of 4 is added as a new head. At the same time, the optimization model has less detection of larger vehicle targets, and we remove the detection head with an output feature map of 20 × 20 in Head. At the same time, considering the inference speed of the model, the C3 module of the Neck part was replaced with a more lightweight DS_C3 module. Finally, to further improve the performance of the IOU-based loss function, the CIOU is replaced with a α-IOU. This paper uses the VisDrone2019 dataset and conducts experiments based on the improved algorithm and the original algorithm, and the results show that the algorithm in this paper can effectively detect small targets in real time.
文章引用:何佳桥, 葛晓东. 基于改进YOLOv5用于无人机图像的实时车辆检测[J]. 软件工程与应用, 2023, 12(5): 705-713. https://doi.org/10.12677/SEA.2023.125068

参考文献

[1] Srivastava, S., Narayan, S. and Mittal, S. (2021) A Survey of Deep Learning Techniques for Vehicle Detection from UAV Images. Journal of Systems Architecture, 117, Article ID: 102152. [Google Scholar] [CrossRef
[2] Zhu, X., Lyu, S., Wang, X. and Zhao, Q. (2021) Tph-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios. 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, 11-17 October 2021, 2778-2788. [Google Scholar] [CrossRef
[3] Mahaur, B. and Mishra, K. (2023) Small-Object Detection Based on YOLOv5 in Autonomous Driving Systems. Pattern Recognition Letters, 168, 115-122. [Google Scholar] [CrossRef
[4] Wang, T., Ma, Z., Yang, T. and Zou, S. (2023) PETNet: A Yolo-Based Prior Enhanced Transformer Network for Aerial Image Detection. Neurocomputing, 547, Article ID: 126384. [Google Scholar] [CrossRef
[5] Li, R. and Shen, Y. (2023) YOLOSR-IST: A Deep Learning Method for Small Target Detection in Infrared Remote Sensing Images Based on Super-Resolution and YOLO. Signal Processing, 208, Article ID: 108962. [Google Scholar] [CrossRef
[6] Huang, Y., He, J., Liu, G., Li, D., Hu, R., Hu, X. and Bian, D. (2023) YOLO-EP: A Detection Algorithm to Detect Eggs of Pomacea canaliculata in Rice Fields. Ecological Informatics, 77, Article ID: 102211. [Google Scholar] [CrossRef
[7] Cui, M., Lou, Y., Ge, Y. and Wang, K. (2023) LES-YOLO: A Lightweight Pinec-Onedetection Algorithm Based on Improved YOLOv4-Tiny Network. Computers and Electronics in Agriculture, 205, Article ID: 107613. [Google Scholar] [CrossRef
[8] Zhou, J., Jiang, P., Zou, A., et al. (2021) Ship Target Detection Algorithm Based on Improved YOLOv5. Journal of Marine Science and Engineering, 9, Article 908. [Google Scholar] [CrossRef
[9] Girshick, R.B., Donahue, J., Darrell, T. and Malik, J. (2013) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. [Google Scholar] [CrossRef
[10] Girshick, R.B. (2015) Fast R-CNN. Proceedings of 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, 7-13 December 2015, 1440-1448. [Google Scholar] [CrossRef
[11] Ren, S., He, K., Girshick, R.B. and Sun, J. (2015) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef
[12] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.E., Fu, C. and Berg, A.C. (2015) SSD: Single Shot MultiBox Detector. In: Leibe, B., Matas, J., Sebe, N. and Welling, M., Eds., ECCV 2016: Computer Vision—ECCV 2016, Springer, Cham, 21-37. [Google Scholar] [CrossRef
[13] Redmon, J., Divvala, S.K., Girshick, R.B. and Farhadi, A. (2015) You Only Look Once: Unified, Real-Time Object Detection. Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 779-788. [Google Scholar] [CrossRef
[14] Redmon, J. and Farhadi, A. (2018) YOLOv3: An Incremental Improvement.
https://arxiv.org/abs/1804.02767
[15] Bochkovskiy, A., Wang, C. and Liao, H.M. (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection.
https://arxiv.org/abs/2004.10934
[16] Do Nascimento, M., Fawcett, R. and Prisacariu, V. (2019) DSConv: Efficient Convolution Operator. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27 October-2 November 2019, 5147-5156. [Google Scholar] [CrossRef