基于YOLOv8实例分割的桥梁裂纹识别
Bridge Crack Recognition Based on YOLOv8 Instance Segmentation
DOI: 10.12677/airr.2025.146144, PDF,    科研立项经费支持
作者: 杨 明:安顺开放大学,贵州 安顺
关键词: 桥梁裂纹检测Yolov8数据增强注意力机制Bridge Crack Detection Yolov8 Data Enhancement Attention Mechanism
摘要: 桥梁结构的安全性是工程领域的重要关注点,裂纹检测是保障桥梁安全的关键环节。本文提出了一种基于YOLOv8实例分割的桥梁裂纹识别方法,旨在解决传统检测方法在准确性上不足的问题。首先,我们改进了数据增强方法,增强了模型对不同环境下裂纹图像的鲁棒性和泛化能力,使得模型在复杂实际应用中表现更加稳定。其次,在C2F模块中引入了Coordinate Attention注意力机制。该机制通过对特征图的通道关系进行建模,提升了模型对关键信息的关注度,从而提高了裂纹检测的准确性。Coordinate Attention在空间和通道维度上融合信息,使模型能够更精准地捕捉到裂纹的细节特征。实验结果表明,本文提出的方法在本文构造的数据集上表现出色,在精度上提高了3.5个百分点,并且召回率和mAP均有提升,对实际工程应用具有重要指导意义。
Abstract: The safety of bridge structures is an important concern in the engineering field, and crack detection is a key link to ensure the safety of bridges. In this paper, we propose a bridge crack recognition method based on YOLOv8 instance segmentation, aiming at solving the problem of insufficient accuracy of traditional detection methods. First, we improve the data enhancement method to enhance the robustness and generalization ability of the model to crack images in different environments, which makes the model perform more stably in complex practical applications. Second, the Coordinate Attention attention mechanism is introduced in the C2F module. This mechanism improves the accuracy of crack detection by modeling the channel relationship of the feature map, which enhances the model’s attention to the key information. Coordinate Attention fuses the information in both spatial and channel dimensions, enabling the model to capture the detailed features of the cracks more accurately. The experimental results show that the method proposed in this paper performs well on several bridge crack datasets, with an improvement of 3.5 percentage points in precision and both recall and mAP, which is of great significance as a guide for practical engineering applications.
文章引用:杨明. 基于YOLOv8实例分割的桥梁裂纹识别[J]. 人工智能与机器人研究, 2025, 14(6): 1544-1550. https://doi.org/10.12677/airr.2025.146144

参考文献

[1] Hou, Q., Zhou, D. and Feng, J. (2021) Coordinate Attention for Efficient Mobile Network Design. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 13708-13717. [Google Scholar] [CrossRef
[2] He, K., Zhang, X., Ren, S. and Sun, J. (2015) Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1904-1916. [Google Scholar] [CrossRef] [PubMed]
[3] Ren, S., He, K., Girshick, R. and Sun, J. (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef] [PubMed]
[4] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., et al. (2016) SSD: Single Shot MultiBox Detector. In: Leibe, B., Matas, J., Sebe, N. and Welling, M., Eds., Computer VisionECCV 2016, Springer, 21-37. [Google Scholar] [CrossRef
[5] Wang, J., Dai, H., Chen, T., Liu, H., Zhang, X., Zhong, Q., et al. (2023) Toward Surface Defect Detection in Electronics Manufacturing by an Accurate and Lightweight Yolo-Style Object Detector. Scientific Reports, 13, Article No. 7062. [Google Scholar] [CrossRef] [PubMed]
[6] Kim, I., Jeon, H., Baek, S., Hong, W. and Jung, H. (2018) Application of Crack Identification Techniques for an Aging Concrete Bridge Inspection Using an Unmanned Aerial Vehicle. Sensors, 18, Article 1881. [Google Scholar] [CrossRef] [PubMed]
[7] Terven, J. and Cordova-Esparza, D. (2023) A Comprehensive Review of YOLO: From YOLOv1 to YOLOv8 and Beyond. arXiv: 2304.00501.
[8] 雷源毅, 朱文球, 廖欢. 复杂场景下的改进YOLOv8n安全帽佩戴检测算法[J]. 软件工程, 2023, 26(12): 46-51.
[9] 曹士杰, 张竹林. 基于改进YOLOv5的矿山石块实例分割算法[J]. 兰州工业学院学报, 2023, 30(6): 19-25.
[10] Bochkovskiy, A., Wang, C.Y. and Liao, H.Y.M. (2020) Yolov4: Optimal Speed and Accuracy of Object Detection. arXiv: 2004.10934.
[11] 梁锦秀. 低光照条件下的图像增强和识别关键技术研究[D]: [博士学位论文]. 广州: 华南理工大学, 2021.
[12] Zheng, C., Jia, W., Wu, S. and Li, Z. (2023) Neural Augmented Exposure Interpolation for Two Large-Exposure-Ratio Images. IEEE Transactions on Consumer Electronics, 69, 87-97. [Google Scholar] [CrossRef
[13] 贾世娜. 基于改进YOLOv5的小目标检测算法研究[D]: [硕士学位论文]. 南昌: 南昌大学, 2022.
[14] Wang, C., Bochkovskiy, A. and Liao, H.M. (2023) YOLOv7: Trainable Bag-Of-Freebies Sets New State-Of-The-Art for Real-Time Object Detectors. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 7464-7475. [Google Scholar] [CrossRef