基于改进YOLOv5的矿石目标检测算法
Ore Target Detection Algorithm Based on Improved YOLOv5
DOI: 10.12677/JSTA.2023.116054, PDF,   
作者: 唐 雨, 刘祚时, 陶树林:江西理工大学电气工程与自动化学院,江西 赣州
关键词: 深度学习目标检测YOLOv5轻量化网络MobileNetv3 Deep Learning Target Detection YOLOv5 Lightweight Network MobileNetv3
摘要: 针对工业矿山生产线上工作量大,在进行目标定位检测时需兼具较高实时性,提出一种基于改进YOLOv5的目标检测算法。该算法在主干特征提取网络引入更轻量化的MobileNetv3,同时将PANet网络3 × 3的卷积块改进,提高了检测精度和效率;使用DIoU-NMS替代原模型中NMS的方式,提高矿石重叠时检测精度。改进后的算法的参数量减少了24.95%,mAP值达到了95.7%,帧数达到了166.7 FPS,检测单张图片的时间缩短了4 ms,表明该算法在矿石传送带上具有更强实用性和更高的检测精度。
Abstract: Aiming at the heavy workload in the industrial mine production line and the high real-time performance in the target positioning detection, a target detection algorithm based on improved YOLOv5 is proposed. The algorithm replaces the backbone feature extraction network CSPDarknet with the lighter MobileNetv3, and improves the 3 × 3 convolutional block of PANet network to improve the detection accuracy and efficiency. Using DIoU-NMS to replace the weighted NMS in the original model further reduces the number of parameters, improves the convergence of the model, and improves the detection accuracy when the targets overlap. The parameter number of the improved algorithm is reduced by 24.95%, the mAP value is 95.7%, the frame number is 166.7 FPS, and the detection time of a single image is shortened by 4 ms, which shows that the algorithm has stronger practicability and higher detection accuracy on the ore conveyor belt.
文章引用:唐雨, 刘祚时, 陶树林. 基于改进YOLOv5的矿石目标检测算法[J]. 传感器技术与应用, 2023, 11(6): 477-484. https://doi.org/10.12677/JSTA.2023.116054

参考文献

[1] 王鹏, 曹现刚, 夏晶, 等. 基于机器视觉的多机械臂煤矸石分拣机器人系统研究[J]. 工矿自动化, 2019, 45(9): 47-53. [Google Scholar] [CrossRef
[2] Redmon, J., Divvala, S., Girshick, R., et al. (2016) You Only Look Once: Unified, Real-Time Object Detection(Conference Paper). Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 779-788. [Google Scholar] [CrossRef
[3] Girshick, R., Donahue, J., Darrell, T., et al. (2014) Rich Feature Hi-erarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. [Google Scholar] [CrossRef
[4] 张艳, 孙晶雪, 孙叶美, 等. 基于分割注意力与线性变换的轻量化目标检测[J]. 浙江大学学报(工学版), 2023, 57(6): 1195-1204.
[5] 王建才, 李瑾, 李志军, 等. 基于改进YOLOv5的煤岩图像检测识别研究[J]. 煤矿机械, 2022, 43(9): 204-208. [Google Scholar] [CrossRef
[6] 冒国韬, 邓天民, 于楠晶. 基于多尺度分割注意力的无人机航拍图像目标检测算法[J]. 航空学报, 2023, 44(5): 273-283.
[7] Harjoseputro, Y., Yuda, P.I. and Danukusumo, P.K. (2020) MobileNets: Efficient Convolutional Neural Network for Identification of Protected Birds. International Journal on Advanced Science, Engineering and Information Technology, 10, 2290-2296. [Google Scholar] [CrossRef
[8] Sandler, M., Howard, A., Zhu, M., et al. (2018) MobileNetV2: Inverted Residuals and Linear Bottlenecks (Conference Paper). Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 4510-4520. [Google Scholar] [CrossRef