基于改进YOLO11的复杂场景带钢表面缺陷检测算法
Complex Scene Strip Steel Surface Defect Detection Algorithm Based on Improved YOLO11
摘要: 针对钢铁表面缺陷检测中存在的形态不规则、特征细微及背景干扰复杂等难题,本文提出一种改进YOLO11的钢铁表面缺陷检测算法。该算法在主干网络的C3k2模块中引入可变形注意力机制,增强模型对不规则缺陷如裂纹(Cr)的空间特征捕捉能力。采用风车形卷积替代部分标准卷积,提升模型对夹杂(Pa)、斑点(Ps)等低对比度细微目标的特征提取性能。同时在C2PSA模块中集成基于变分率削减的线性复杂度注意力机制(TSSA),有效实现全局上下文信息交互。在NEU-DET数据集上的实验表明,改进模型mAP@0.5达到80.76%,相比原始YOLO11提升3.16%,对比其他的目标检测主流方法,改进算法精度上都有了明显的提升,满足钢铁表面缺陷检测的精度要求。
Abstract: To address challenges in steel surface defect detection, such as irregular shapes, subtle features, and complex background interference, this paper proposes an improved YOLO11 algorithm for steel surface defect detection. The algorithm introduces a deformable attention mechanism in the C3k2 module of the backbone network, enhancing the model’s ability to capture spatial features of irregular defects such as cracks (Cr). Part of the standard convolution is replaced with pinwheel-shaped convolutions to improve the model’s feature extraction performance for low-contrast, fine targets such as inclusions (Pa) and spots (Ps). Additionally, the C2PSA module integrates a linear-complexity attention mechanism (TSSA) based on variance reduction, effectively enabling global context information interaction. Experiments on the NEU-DET dataset show that the improved model achieves an mAP@0.5 of 80.76%, an increase of 3.16% compared to the original YOLO11. Compared with other mainstream object detection methods, the improved algorithm demonstrates a significant improvement in accuracy, meeting the precision requirements for steel surface defect detection.
文章引用:王盼, 李莉. 基于改进YOLO11的复杂场景带钢表面缺陷检测算法[J]. 计算机科学与应用, 2026, 16(2): 240-250. https://doi.org/10.12677/csa.2026.162055

参考文献

[1] 米春风, 卢琨, 汪文艳, 等. 基于机器视觉的带钢表面缺陷检测研究进展[J]. 钢铁研究学报, 2022, 34(5): 401-414.
[2] 李跃, 李铁柱, 张宝, 等. 带钢表面缺陷检测方法研究进展[J]. 钢铁研究学报, 2023, 35(8): 950-962.
[3] 马冬梅, 朱佳浩. 面向热轧带钢表面缺陷检测的YOLOv5算法优化分析[J]. 计算机应用与软件, 2023, 40(2): 178-184.
[4] 樊嵘, 张雪峰, 罗鑫, 等. 面向带钢表面小目标缺陷检测的改进YOLOv7算法[J]. 合肥工业大学学报(自然科学版), 2024, 47(3): 367-376.
[5] 马燕婷, 郭晓峰, 高毅, 等. 改进YOLOv5网络的带钢表面缺陷检测方法[J]. 电子测量与仪器学报, 2023, 36(8): 150-159.
[6] 周亚罗, 李志强, 刘凤春, 等. 基于STCS-YOLO的带钢表面缺陷检测算法[J]. 中国冶金, 2023, 33(12): 128-138.
[7] 王春梅, 刘欢. YOLOv8-VSC: 一种轻量级的带钢表面缺陷检测算法[J]. 计算机科学与探索, 2024, 18(1): 151-160.
[8] 戴林华, 黎远松, 石睿. HSED-YOLO: 一种轻量化的带钢表面缺陷检测模型[J]. 广西师范大学学报(自然科学版), 2025, 43(2): 65-77.
[9] 刘凤春, 李志强, 周亚罗, 等. YOLO-VDCW: 一种新的轻量化带钢表面缺陷检测算法[J]. 中国冶金, 2024, 34(6): 125-135.
[10] Lv, B., Duan, B., Zhang, Y., et al. (2024) Research on Surface Defect Detection of Strip Steel Based on Improved YOLOv7. Sensors, 24, Article 2667. [Google Scholar] [CrossRef] [PubMed]
[11] Zhou, S., Ao, S., Yang, Z., et al. (2024) Surface Defect Detection of Steel Plate Based on SKS-YOLO. IEEE Access, 12, 76723-76737. [Google Scholar] [CrossRef
[12] Huang, F., Wen, H., Zhang, Z., et al. (2024) A Comparative Study of YOLOv9 and YOLOv10 for Steel Surfaced Effect Detection. 2024 30th International Conference on Mechatronics and Machine Vision in Practice (M2VIP), Leeds, 3-5 October 2024, 1-6. [Google Scholar] [CrossRef
[13] Xia, Z., Pan, X., Song, S., Li, L.E. and Huang, G. (2022) Vision Transformer with Deformable Attention. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 4784-4793. [Google Scholar] [CrossRef
[14] Yang, J., Liu, S., Wu, J., Su, X., Hai, N. and Huang, X. (2025) Pinwheel-Shaped Convolution and Scale-Based Dynamic Loss for Infrared Small Target Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 39, 9202-9210. [Google Scholar] [CrossRef
[15] Wu, Z., Ding, T., Lu, Y., Pai, D., Zhang, J., Wang, W., et al. (2024) Token Statistics Transformer: Linear-Time Attention via Variational Rate Reduction. arXiv: 2412.17810.
[16] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., et al. (2016) SSD: Single Shot MultiBox Detector. In: Leibe, B., Matas, J., Sebe, N. and Welling, M., Eds., Computer VisionECCV 2016, Springer, 21-37. [Google Scholar] [CrossRef
[17] Farhadi, A. and Redmon, J. (2018) YOLOv3: An Incremental Improvement. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-22 June 2018, 7794-7803.
[18] Tian, Y., Ye, Q. and Doermann, D. (2025) YOLOv12: Attention-Centric Real-Time Object Detectors. arXiv: 2502.12524.
[19] Feng, Y., Huang, J., Du, S., Ying, S., Yong, J.H., Li, Y., et al. (2024) Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation. arXiv: 2408.04804.
[20] Zhao, Y., Lv, W., Xu, S., Wei, J., Wang, G., Dang, Q., et al. (2024) DETRs Beat YOLOs on Real-Time Object Detection. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 16-22 June 2024, 16965-16974. [Google Scholar] [CrossRef