YOLO-SSDA——基于改进YOLOv9的吸烟行为检测方法
YOLO-SSDA—Smoking Behavior Detection Method Based on Improved YOLOv9
DOI: 10.12677/csa.2024.149187, PDF,   
作者: 何超勋, 陈智霖, 黄声勇, 彭道福:广东省汕尾市城区香洲头汕尾供电局,广东 汕尾
关键词: 吸烟检测YOLOv9NMS算法SimAM距离关联Smoking Detection YOLOv9 NMS Algorithm SimAM Distance Association
摘要: 针对复杂危险环境下传统吸烟行为检测方法成本较高、实时性较差、识别率低、容易漏检和小目标误判率高等问题,提出一种基于改进YOLOv9的吸烟行为检测算法YOLO-SSDA。YOLO-SSDA通过引入注意力机制模块SimAM,加强骨干网络对香烟特征的提取能力;利用Soft-NMS算法避免在过滤重叠定位框时出现漏检可能;利用距离关联算法计算香烟与人体之间的距离关系,减少香烟识别误判率。本文通过网络采样、AI生成与现场采集等方式构建了实验数据集并进行了大量实验测试,实验结果表明:YOLO-SSDA算法准确率为94%,召回率为89.6%,mAP50值为94.4%,处理单振图像输入的平均耗时为39 ms,与其他算法相比具有较高的检测准确率和实时性能。
Abstract: Aiming at the problems of high cost, poor real-time performance, low recognition rate, easy missed detection, and high misjudgment rate of small targets in traditional smoking behavior detection methods in complex and dangerous environments, a smoking behavior detection algorithm YOLO-SSDA based on improved YOLOv9 is proposed. By introducing the attention mechanism module SimAM, the backbone network’s ability to extract cigarette features is strengthened; using Soft NMS algorithm to avoid the possibility of missed detection when filtering overlapping positioning boxes; using distance correlation algorithm to calculate the distance relationship between cigarettes and the human body, in order to reduce the misjudgment rate of cigarette recognition. This article constructed an experimental dataset through network sampling, AI generation, and on-site collection, and conducted extensive testing based on it. The results show that the YOLO-SSDA algorithm proposed in this paper has an accuracy of 94%, a recall rate of 89.6%, an mAP50 value of 94.4%, and an average processing time of 39 ms for single vibration image input. Compared with other algorithms, it has higher detection accuracy and real-time performance.
文章引用:何超勋, 陈智霖, 黄声勇, 彭道福. YOLO-SSDA——基于改进YOLOv9的吸烟行为检测方法[J]. 计算机科学与应用, 2024, 14(9): 56-65. https://doi.org/10.12677/csa.2024.149187

参考文献

[1] 何嘉彬, 李雷孝, 林浩, 等. 面向计算机视觉的吸烟检测方法研究综述[J]. 计算机工程与应用, 2024, 60(1): 40-56.
[2] Sun, J. and Sun, Y. (2019) Research on Methods of Mine Fire Monitoring and Trend Prediction. Industry and Mine Automtion, 45, 1-4.
[3] Xang, X.P., Wang, X.Q., Lin, H., et al. (2022) Review on Improvement of Typical Object Detection Algorithms in Deep Learning. Computer Engineering and Applications, 58, 42-57.
[4] 梁绪, 李文新, 张航宁. 人体行为识别方法研究综述[J]. 计算机应用研究, 2022, 39(3): 651-660.
[5] Wang, C.-Y., Yeh, I.-H. and Liao, H.-Y.M. (2024) YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information. arXiv: 2402.13616. [Google Scholar] [CrossRef
[6] Ghuli, P., Shashank, B.N. and Rao, A.G. (2019) Development of Framework for Detecting Smoking Scene in Video Clips. Indonesian Journal of Electrical Engineering and Computer Science, 13, 22-26. [Google Scholar] [CrossRef
[7] Liao, J. and Zou, J. (2020) Smoking Target Detection Based on YOLOv3. 2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE), Harbin, 25-27 December 2020, 2241-2244. [Google Scholar] [CrossRef
[8] Zou, L., Shao, J. and Yu, Y. (2021) Smoking Behavior Monitoring System in Public Places Based on YOLOv5I+CBAM. 2021 Smart City Challenges & Outcomes for Urban Transformation (SCOUT), Bhubaneswar, 25-26 December 2021, 100-106. [Google Scholar] [CrossRef
[9] 杨国亮, 龚志鹏, 黄聪, 等. 基于深度学习的吸烟行为实时检测[J]. 安全与环境学报, 2023, 23(10): 3696-3705.
[10] Zhang, Z., Chen, H., Xiao, R. and Li, Q. (2021) Research on Smoking Detection Based on Deep Learning. Journal of Physics: Conference Series, 2024, Article 012042. [Google Scholar] [CrossRef
[11] Yang, T., Yang, J. and Meng, J. (2021) Driver’s Illegal Driving Behavior Detection with SSD Approach. 2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML), Chengdu, 16-18 July 2021, 109-114. [Google Scholar] [CrossRef
[12] Wang, C.-Y., Liao, H.-Y.M. and Yeh, I.-H. (2023) Designing Network Design Strategies through Gradient Path Analysis. Journal of Information Science and Engineering, 39, 975-995.
[13] Yang, L., Zhang, R.-Y., Li, L. and Xie, X. (2021) SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks. Proceedings of the 38th International Conference on Machine Learning, 18-24 July 2021, 11863-11874.
[14] Bodla, N., Singh, B., Chellappa, R. and Davis, L.S. (2017) Soft-NMS—Improving Object Detection with One Line of Code. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 5562-5570. [Google Scholar] [CrossRef
[15] 王彬, 赵作鹏. 基于YOLOv7的矿工吸烟识别方法研究[J]. 现代信息科技, 2024, 8(6): 66-69.
[16] 孙冰, 李好, 黄鑫凯, 等. 基于YOLOv7的人体关联实时吸烟目标检测方法[J]. 软件工程, 2024, 27(1): 64-67.
[17] 张洪硕. 基于YOLOv4的工地吸烟识别研究及系统实现[D]: [硕士学位论文]. 海口: 海南大学, 2024.
[18] 吴浩, 武晨. 基于深度学习的网络信息体系能力评估方法[J]. 中国电子科学研究院学报, 2021, 16(5): 524-532.