基于GS-MobileNet的轻量级道路异常检测算法
A Lightweight Road Anomaly Detection Algorithm Based on GS-MobileNet
DOI: 10.12677/csa.2026.164152, PDF,   
作者: 余见涛, 龙传煜, 罗 帆, 郑仁胜, 田雨雨:贵州乌江水电开发有限责任公司乌江渡发电厂,贵州 遵义
关键词: 路面缺陷检测深度学习轻量化GS-MobileNetRoad Defect Detection Deep Learning Lightweight GS-MobileNet
摘要: 针对传统路面缺陷检测方法在复杂场景下实时性差、误检率高,以及现有深度学习模型难以兼顾精度与边缘部署效率的问题,本文构建了轻量化的路面缺陷检测算法GS-MobileNet。基于MobileNetV4-Ghost的主干网络架构。通过引入Ghost模块的廉价线性变换操作,在保证特征表达能力的同时显著降低模型参数量。创新设计的局部–全局协同注意力机制,在浅层和中层网络嵌入BoT模块强化裂纹边缘连续性建模,深层采用SimAM无参注意力动态抑制背景噪声。进一步通过BiFPN动态加权多尺度融合策略,有效改善微小裂纹的检出能力,并结合分组查询解码器实现分类–定位任务的高效对齐。最终构建GS-MobileNet模型以3.12 M参数量取得91.3%的精确率与92.1%的mAP。
Abstract: Addressing the issues of poor real-time performance and high false positive rates of traditional road defect detection methods in complex scenarios, as well as the difficulty of existing deep learning models in balancing accuracy and edge deployment efficiency, this paper constructs a lightweight road defect detection algorithm named GS-MobileNet. The algorithm is based on the MobileNetV4-Ghost backbone network architecture. By introducing the Ghost module’s cheap linear transformation operations, it significantly reduces the number of model parameters while ensuring feature representation capability. The innovatively designed local-global cooperative attention mechanism embeds the BoT module in shallow and middle layers to enhance crack edge continuity modeling, while the deep layers adopt the SimAM parameter-free attention to dynamically suppress background noise. Furthermore, the BiFPN dynamic weighted multi-scale fusion strategy effectively improves the detection capability for tiny cracks, and combined with the group query decoder, it achieves efficient alignment of classification and localization tasks. The final GS-MobileNet model, with only 3.12 M parameters, achieves a precision of 91.3% and an mAP of 92.1%.
文章引用:余见涛, 龙传煜, 罗帆, 郑仁胜, 田雨雨. 基于GS-MobileNet的轻量级道路异常检测算法[J]. 计算机科学与应用, 2026, 16(4): 549-562. https://doi.org/10.12677/csa.2026.164152

参考文献

[1] Kirschke, K.R. and Velinsky, S.A. (1992) Histogram‐Based Approach for Automated Pavement‐Crack Sensing. Journal of Transportation Engineering, 118, 700-710. [Google Scholar] [CrossRef
[2] Li, P., Wang, C., Li, S. and Feng, B. (2015) Research on Crack Detection Method of Airport Runway Based on Twice-Threshold Segmentation. 2015 Fifth International Conference on Instrumentation and Measurement, Computer, Communication and Control (IMCCC), Qinhuangdao, 18-20 September 2015, 1716-1720. [Google Scholar] [CrossRef
[3] Oliveira, H. and Correia, P.L. (2009) Automatic Road Crack Segmentation Using Entropy and Image Dynamic Thresholding. European Signal Processing Conference (EUSIPCO), Glasgow, 24-28 August 2009, 622-626.
[4] 周强, 脱羚, 王莹, 等. 基于多特征信息融合的陶瓷墙地缺陷在线检测[J]. 硅酸盐通报, 2019, 38(1): 259-264.
[5] Qin, D., Leichner, C., Delakis, M., Fornoni, M., Luo, S., Yang, F., et al. (2024) Mobilenetv4: Universal Models for the Mobile Ecosystem. In: Leonardis, A., et al., Eds., Computer VisionECCV 2024, Springer, 78-96. [Google Scholar] [CrossRef
[6] Tang, Y., Han, K., Guo, J., et al. (2022) GhostNetV2: Enhance Cheap Operation with Long-Range Attention. Proceedings of the 36th International Conference on Neural Information Processing Systems, New Orleans, 28 November-9 December 2022, 9969-9982.
[7] Srinivas, A., Lin, T.Y., Parmar, N., et al. (2021) Bottleneck Transformers for Visual Recognition.
http://arxiv.org/abs/2101.11605
[8] Yang, L., Zhang, R.Y., Li, L., et al. (2021) SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks. Proceedings of the 38th International Conference on Machine Learning, PMLR, Vol. 139, 11863-11874.
[9] Tan, M., Pang, R. and Le, Q.V. (2020) Efficientdet: Scalable and Efficient Object Detection. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 10778-10787. [Google Scholar] [CrossRef
[10] Gennari, M., Fawcett, R. and Prisacariu, V.A. (2019) DSConv: Efficient Convolution Operator. 2019 IEEE/CVF International Conference on Computer Vision Workshops, ICCV Workshops 2019, Seoul, 27-28 October 2019, 5148-5157.
http://arxiv.org/abs/1901.01928
[11] Zhao, Y., Lv, W., Xu, S., Wei, J., Wang, G., Dang, Q., et al. (2024) DETRs Beat YOLOs on Real-Time Object Detection. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 16-22 June 2024, 16965-16974. [Google Scholar] [CrossRef
[12] Jocher, G. (2020) YOLOv5 by Ultralytics.
https://github.com/ultralytics/yolov5
[13] Wang, C., Bochkovskiy, A. and Liao, H.M. (2023) Yolov7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 7464-7475. [Google Scholar] [CrossRef