一种基于YOLOv3的道路多目标检测方法
A Road Multi-Object Detection Method Based on YOLOv3
DOI: 10.12677/CSA.2021.111021, PDF,    国家自然科学基金支持
作者: 傅景超, 苏庆华*, 张娣娣, 李俊韬:北京物资学院,北京;北京市智能物流实验室,北京
关键词: YOLOv3道路场景多目标检测迁移学习实时检测YOLOv3 Road Scene Multi-Object Detection Transfer Learning Real-Time Detection
摘要: 道路场景中多目标检测对车辆的自动驾驶和辅助驾驶的智能化有着重要意义。现有道路目标检测算法存在检测精度低、实时性较差和目标漏检等问题。针对这些问题,本文构建一种基于YOLOv3的高精度、低时延、低漏检的道路多目标检测方法。通过对YOLOv3目标检测原理进行深入分析,基于迁移学习(Transfer Learning)的思想,在经过预训练的模型上仅使用Pascal VOC 2007中道路场景常见类别数据对模型进行训练,通过调整学习策略,利用较小的训练集和较少的训练轮次可以获得实时性强、精度较高的目标检测模型,单张图片检测时间只需0.04秒,在测试集上mAP (mean Average Precision)达到了91.5%,实验证明本文方法的有效性,该方法在精度、时延和漏检方面取得了较好的效果。
Abstract: Multi-object detection in road scenes is of great significance for automatic driving and intelligent driving assistance of vehicles. The existing road object detection algorithms have some problems, such as low detection accuracy, poor real-time performance and missing target detection. To solve these problems, this paper constructs a road multi-object detection method based on YOLOv3 with high precision, low delay and low omission. Through indepth analysis of the YOLOv3 object detection principle, based on the idea of Transfer Learning, the YOLOv3 model was trained on the pretrained model using only the common category data of Pascal VOC 2007 (Pascal Visual Object Classes 2007) in the road scenes. By adjusting the learning strategy and using smaller training set and fewer training epochs, a target detection model with strong real-time performance and high accuracy can be obtained. The single image detection time is only 0.04 seconds, and the mean Average Precision on the test set reaches 91.5%. The experimental results show that the proposed method is effective and achieves good results in precision, delay and omission detection.
文章引用:傅景超, 苏庆华, 张娣娣, 李俊韬. 一种基于YOLOv3的道路多目标检测方法[J]. 计算机科学与应用, 2021, 11(1): 207-216. https://doi.org/10.12677/CSA.2021.111021

参考文献

[1] 徐建亮, 周明安, 毛建辉, 方坤礼. 基于一种卷积神经式类网络的实时人脸识别方法研究[J]. 计算机科学与应用, 2020, 10(1): 11-20.
[2] 朱波, 黄茂飞, 谈东奎, 等. 基于神经网络与数据融合的行人检测方法[J]. 汽车工程, 2020(11): 37-44.
[3] 彭育辉, 郑玮鸿, 张剑锋. 基于深度学习的道路障碍物检测方法[J]. 计算机应用, 2020, 40(8): 2428-2433.
[4] 李冬伟, 孙卓, 常书林. 基于低分辨率车牌识别系统的应用研究[J]. 计算机科学与应用, 2020, 10(4): 721-731.
[5] Andreas, N. and Hertzberg, J. (2008) Towards Semantic Maps for Mobile Robots. Robotics & Autonomous Systems, 56, 915-926. [Google Scholar] [CrossRef
[6] Cadena, C., Carlone, L., Carrillo, H., Latif, Y., Scaramuzza, D., et al. (2016) Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age. IEEE Transactions on Robotics, 32, 1309-1332. [Google Scholar] [CrossRef
[7] Lowe, D.G. (2004) Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60, 91-110. [Google Scholar] [CrossRef
[8] Dalal, N. and Triggs, B. (2005) Histograms of Oriented Gradients for Human Detection. IEEE Computer Society Conference on Computer Vision & Pattern Recognition, San Diego, 20-26 June 2005, 886-893. [Google Scholar] [CrossRef
[9] Lecun, Y., Bengio, Y. and Hinton, G. (2015) Deep Learning. Nature, 521, 436. [Google Scholar] [CrossRef] [PubMed]
[10] Zhao, Z.Q., Zheng, P., Xu, S.T. and Wu, X. (2018) Object Detection with Deep Learn-ing: A Review.
https://arxiv.org/abs/1807.05511
[11] Redmon, J., Divvala, S., Girshick, R. and Farhadi, A. (2016) You Only Look Once: Uni-fied, Real-Time Object Detection. IEEE Computer Vision & Pattern Recognition, Las Vegas, 26 June-1 July 2016, 779-788. [Google Scholar] [CrossRef
[12] Felzenszwalb, P., et al. (2010) Object Detection with Discriminatively Trained Part-Based Models. IEEE Transactions on Pattern Analysis & Machine Intelligence, 32, 1627-1645. [Google Scholar] [CrossRef
[13] Girshick, R., Donahue, J., Darrell, T. and Malik, J. (2014) Rich Feature Hierar-chies for Accurate Object Detection and Semantic Segmentation. IEEE CVPR, Columbus, 24-27 June 2014, 580-587. [Google Scholar] [CrossRef
[14] Girshick, R. (2015) Fast R-CNN. IEEE International Conference on Computer Vi-sion (ICCV), Santiago, 1440-1448. [Google Scholar] [CrossRef
[15] Ren, S., He, K., Girshick, R. and Sun, J. (2017) Faster r-cnn: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis & Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef
[16] Redmonand, J. and Farhadi, A. (2018) Yolov3: An Incremental Improvement.
https://arxiv.org/pdf/1804.02767
[17] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recogni-tion. IEEE Conference on Computer Vision & Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[18] Pan, S.J. and Yang, Q. (2010) A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering, 22, 1345-1359. [Google Scholar] [CrossRef
[19] Kingma, D. and Ba, J. (2014) Adam: A Method for Stochastic Optimization.
https://arxiv.org/abs/1412.6980