YOLO-AF:结合注意力机制和焦点卷积的垃圾堆检测模型
YOLO-AF: A Garbage Heap Detection Model Combining Attention Mechanism and Focus Convolution
DOI: 10.12677/CSA.2024.142047, PDF,   
作者: 宋 琦, 苑春苗, 曹立昆, 黄臣臣:天津工业大学软件学院,天津;杨清永*:天津中德应用技术大学软件与通信学院,天津
关键词: 垃圾堆检测YOLOv7算法注意力机制焦点模块特征提取Garbage Heap Detection YOLOv7 Algorithm Attention Mechanism Focus Module Feature Extraction
摘要: 随着城镇化进程的不断加速,垃圾堆成为城市管理的重要问题,智能化的垃圾堆检测算法,可以有效提高环境管理的水平和效率。在YOLOv7检测算法的基础上,通过引入注意力机制、改进MPConv模块以及优化损失函数对模型进行改进,提出了YOLO-AF垃圾堆检测模型。引入注意力机制和使用焦点模块增强特征信息解决垃圾堆和背景相似导致检测精度低问题,综合利用二元交叉熵损失函数和Kullback-Leibler散度损失函数解决垃圾分类不平衡问题。实验结果表明,在多类型垃圾堆检测任务中,YOLO-AF垃圾堆检测模型相较于YOLOv7表现出更高的精度、召回率和mAP,性能得到显著提升,训练300轮次的模型mAP值为92.45%。在同等条件下,检测效果优于当前主流的目标检测算法,可以较好地满足垃圾堆实时工业检测的需求。
Abstract: With the continuous acceleration of urbanization, garbage heaps have become an important problem in urban management, and intelligent garbage heap detection algorithms can effectively im-prove the level and efficiency of environmental management. On the basis of the YOLOv7 detection algorithm, the YOLO-AF garbage heap detection model was proposed by introducing the attention mechanism, improving the MPConv module and optimizing the loss function. The attention mechanism is introduced and the focus module is used to enhance the feature information to solve the problem of low detection accuracy caused by the similarity of garbage and background. The binary cross entropy loss function and Kullback-Leibler divergence loss function are comprehensively used to solve the unbalanced problem of garbage classification. Experimental results show that the YOLO-AF garbage heap detection model shows higher accuracy, recall rate and mAP than YOLOv7 in multi-type garbage heap detection tasks, and the performance is significantly improved, and the mAP value of the model after 300 rounds of training is 92.45%. Under the same conditions, the de-tection effect is better than the current mainstream object detection algorithm, which can better meet the needs of real-time industrial detection of garbage heaps.
文章引用:宋琦, 苑春苗, 曹立昆, 黄臣臣, 杨清永. YOLO-AF:结合注意力机制和焦点卷积的垃圾堆检测模型[J]. 计算机科学与应用, 2024, 14(2): 468-479. https://doi.org/10.12677/CSA.2024.142047

参考文献

[1] Wang, Y. and Zhang, X. (2018) Autonomous Garbage Detection for Intelligent Urban Management. MATEC Web of Conferences, 232, Article 01056. [Google Scholar] [CrossRef
[2] Zeng, D., Zhang, S., Chen, F., et al. (2019) Multi-Scale CNN Based Garbage Detection of Airborne Hyperspectral Data. IEEE Access, 7, 104514-104527. [Google Scholar] [CrossRef
[3] Patel, D., Patel, F., Patel, S., et al. (2021) Garbage Detection Using Advanced Object Detection Techniques. 2021 International Conference on Artificial Intelli-gence and Smart Systems (ICAIS), Coimbatore, 25-27 March 2021, 526-531. [Google Scholar] [CrossRef
[4] Zhao, H., Zhang, H. and Zhao, Y. (2023) Yolov7-Sea: Object Detection of Maritime UAV Images Based on Improved Yolov7. Proceedings of the IEEE/CVF Winter Confer-ence on Applications of Computer Vision Workshops, Waikoloa, HI, 3-7 January 2023, 233-238. [Google Scholar] [CrossRef
[5] Mittal, G., Yagnik, K.B., Garg, M., et al. (2016) Spot-garbage: Smartphone App to Detect Garbage Using Deep Learning. Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, Heidelberg, 12-16 September 2016, 940-945. [Google Scholar] [CrossRef
[6] Camero, A., Toutouh, J., Ferrer, J., et al. (2019) Waste Generation Prediction in Smart Cities through Deep Neuroevolution. Ibero-American Congress of Smart Cities, ICSC-CITIES 2018, Soria, 26-27 September 2018, 192-204. [Google Scholar] [CrossRef
[7] Ruiz, V., Sánchez, Á., Vélez, J.F., et al. (2019) Automatic Image-Based Waste Classification. International Work-Conference on the Interplay between Natural and Artificial Com-putation, Almería, 3-7 June 2019, 422-431. [Google Scholar] [CrossRef
[8] Anjum, M. and Umar, M.S. (2018) Garbage Localization Based on Weakly Supervised Learning in Deep Convolutional Neural Network. 2018 International Conference on Ad-vances in Computing, Communication Control and Networking (ICACCCN), Greater Noida, 12-13 October 2018, 1108-1113. [Google Scholar] [CrossRef
[9] Aleena, V.J., Balakrishnan, K., Rosmi, T.B., et al. (2016) Automatic Waste Segregator and Monitoring System. Journal of Microcontroller Engineering and Applica-tions, 3, 1-7.
[10] Singh, A., Aggarwal, P. and Arora, R. (2016) IoT Based Waste Collection System Using Infrared Sensors. 2016 5th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, 7-9 September 2016, 505-509. [Google Scholar] [CrossRef
[11] Mahankali, S., Kabbin, S.V., Nidagundi, S., et al. (2018) Identification of Illegal Garbage Dumping with Video Analytics. 2018 International Conference on Advances in Compu-ting, Communications and Informatics (ICACCI), Bangalore, 19-22 September 2018, 2403-2407. [Google Scholar] [CrossRef
[12] Redmon, J., Divvala, S., Girshick, R., et al. (2016) You Only Look Once: Unified, Real-Time Object Detection. IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 779-788. [Google Scholar] [CrossRef
[13] Ren, S.Q., He, K.M., Girshick, R., et al. (2017) Faster R-CNN: To-wards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Ma-chine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef
[14] Chao, X., Hu, X., Feng, J., et al. (2021) Construction of Apple Leaf Diseases Identification Networks Based on Xception Fused by SE Module. Applied Sciences, 11, Article 4614. [Google Scholar] [CrossRef
[15] Jiang, Y., Chen, L., Zhang, H., et al. (2019) Breast Cancer Histopatho-logical Image Classification Using Convolutional Neural Networks with Small SE-ResNet Module. PLOS ONE, 14, e0214587. [Google Scholar] [CrossRef] [PubMed]
[16] Zheng, Y., Cui, Y. and Gao, X. (2023) An Infrared Dim-Small Target Detection Method Based on Improved YOLOv7. Proceedings of the 2023 Asia Conference on Computer Vision, Image Processing and Pattern Recognition, Phuket, 28-30 April 2023, 1-5. [Google Scholar] [CrossRef
[17] Weng, Z., Hu, R. and Zheng, Z. (2023) Study on Individual Identi-fication Method of Cow Based on CD-YOLOv7. Proceedings of the 2023 3rd International Conference on Bioinfor-matics and Intelligent Computing, Sanya, 10-12 February 2023, 169-175. [Google Scholar] [CrossRef
[18] Da Cruz, L.B., Júnior, D.A.D., Diniz, J.O.B., et al. (2022) Kidney Tumor Segmentation from Computed Tomography Images Using DeepLabv3+ 2.5D Model. Expert Systems with Appli-cations, 192, Article 116270. [Google Scholar] [CrossRef
[19] Ruby, U. and Yendapalli, V. (2020) Binary Cross Entropy with Deep Learning Technique for Image Classification. International Journal of Advanced Trends in Computer Science and Engineering, 9, 5393-5397. [Google Scholar] [CrossRef
[20] Van Erven, T. and Harremos, P. (2014) Rényi Divergence and Kullback-Leibler Divergence. IEEE Transactions on Information Theory, 60, 3797-3820. [Google Scholar] [CrossRef
[21] Fang, J., et al. (2022) Single-Target Detection of Oncomelania hu-pensis Based on Improved YOLOv7s. Frontiers in Bioengineering and Biotechnology, 10, Article 861079. [Google Scholar] [CrossRef] [PubMed]