基于通道混洗和注意力机制的轻量缺陷检测算法
A Lightweight Defect Detection Algorithm Based on Channel Shuffling and Attention Mechanism
DOI: 10.12677/MOS.2024.131094, PDF,    科研立项经费支持
作者: 黄轶凡, 金 涛:上海理工大学光电信息与计算机工程学院,上海;黄之文:上海理工大学机械工程学院,上海
关键词: 表面缺陷检测图像分割U型卷积神经网络注意力机制混合损失计算Surface Defect Detection Image Segmentation U-Shaped Convolutional Neural Network Attention Mechanism Hybrid Loss Calculation
摘要: 缺陷检测在制造业中扮演着确保产品质量和效率的关键角色。由于缺陷通常没有固定形状,并且受光照影响较大,因此基于图像的高精度缺陷检测成为一项极具挑战性的任务。本文针对缺陷图像的特点,提出了一种结合了通道混洗和注意力机制的U型结构的卷积神经网络。首先在跳跃连接中加入注意力门机制来提高网络的泛化能力;其次,针对上采样中容易产生边缘轮廓失真的问题,采用了混合损失计算;最后,因使用了效率更高的编码解码器,在模型参数和浮点运算数较低的情况下,所提出的模型MIoU和F1指标达到了91.89,94.67和64.82,72.93。与FCN、U-Net、U-Net++进行了比较,结果表明所提出的方法在表面缺陷检测领域优于相关方法。
Abstract: Defect detection plays a crucial role in ensuring product quality and efficiency in manufacturing. Due to the irregular shapes of defects and their susceptibility to lighting variations, image-based high-precision defect detection becomes a highly challenging task. In response to the characteristics of defect images, this paper proposes a U-shaped convolutional neural network with a hybrid loss calculation and attention mechanism. Firstly, an attention gate mechanism is introduced in the skip connections to enhance the network’s generalization ability. Secondly, to address the issue of edge contour distortion during up-sampling, a hybrid loss calculation is employed. Lastly, leveraging more efficient encoding and decoding layers, the proposed model achieves MIoU and F1 scores of 91.89, 94.67 and 64.82, 72.93, respectively, with lower model parameters and floating-point opera-tions. Comparative analysis against FCN, U-Net, and U-Net++ demonstrates the superior perfor-mance of the proposed method in the field of surface defect detection.
文章引用:黄轶凡, 黄之文, 金涛. 基于通道混洗和注意力机制的轻量缺陷检测算法[J]. 建模与仿真, 2024, 13(1): 976-985. https://doi.org/10.12677/MOS.2024.131094

参考文献

[1] Koch, C., Georgieva, K., Kasireddy, V., et al. (2015) A Review on Computer Vision Based Defect Detection and Condition Assessment of Concrete and Asphalt Civil Infrastructure. Advanced Engineering Informatics, 29, 196-210. [Google Scholar] [CrossRef
[2] Chen, Y., Ding, Y., Zhao, F., et al. (2021) Surface Defect Detection Meth-ods for Industrial Products: A Review. Applied Sciences, 11, Article No. 7657. [Google Scholar] [CrossRef
[3] LeCun Y, Boser, B., Denker, J.S., et al. (1989) Backpropagation Applied to Handwritten Zip Code Recognition. Neural Computation, 1, 541-551. [Google Scholar] [CrossRef
[4] Simonyan, K. and Zisserman, A. (2014) Very Deep Convolutional Net-works for Large-Scale Image Recognition. Arxiv:1409.1556
[5] Ren, S., He, K., Girshick, R., et al. (2015) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149.
[6] Long, J., Shelhamer, E. and Darrell, T. (2015) Fully Convolutional Networks for Semantic Segmentation. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, 07-12 June 2015, 3431-3440. [Google Scholar] [CrossRef
[7] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolution-al Networks for Biomedical Image Segmentation. International Conference on Medical Image Computing and Comput-er-Assisted Intervention MICCAI 2015: Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015, Mu-nich, 5-9 October 2015, 234-241. [Google Scholar] [CrossRef
[8] Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., et al. (2019) Unet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation. IEEE Transactions on Medical Imaging, 39, 1856-1867. [Google Scholar] [CrossRef
[9] Badrinarayanan, V., Kendall, A. and SegNet, R.C. (2015) A Deep Con-volutional Encoder-Decoder Architecture for Image Segmentation. Arxiv:1511.00561
[10] Chen, L.C., Zhu, Y., Papandreou, G., et al. (2018) Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, 8-14 September 2018, 801-818. [Google Scholar] [CrossRef
[11] Yang, F., Zhang, L., Yu, S., et al. (2019) Feature Pyramid and Hier-archical Boosting Network for Pavement Crack Detection. IEEE Transactions on Intelligent Transportation Systems, 21, 1525-1535. [Google Scholar] [CrossRef
[12] Fei, Y., Wang, K.C.P., Zhang, A., et al. (2019) Pixel-Level Cracking Detection on 3D Asphalt Pavement Images through Deep-Learning-Based CrackNet-V. IEEE Transactions on Intelligent Transportation Systems, 21, 273-284. [Google Scholar] [CrossRef
[13] Zhang, X., Zhou, X., Lin, M., et al. (2018) Shufflenet: An Extremely Ef-ficient Convolutional Neural Network for Mobile Devices. Proceedings of the IEEE Conference on Computer Vision and Pat-tern Recognition, Salt Lake City, 18-23 June 2018, 6848-6856. [Google Scholar] [CrossRef
[14] Oktay, O., Schlemper, J., Folgoc, L.L., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas. Arxiv:1804.03999
[15] Tabernik, D., Šela, S., Skvarč, J., et al. (2020) Segmentation-Based Deep-Learning Approach for Sur-face-Defect Detection. Journal of Intelligent Manufacturing, 31, 759-776. [Google Scholar] [CrossRef
[16] Huang, Y., Qiu, C. and Yuan, K. (2020) Surface Defect Saliency of Magnetic Tile. The Visual Computer, 36, 85-96. [Google Scholar] [CrossRef