基于改进YOLO11-Seg的轻量化端子接线分割模型
Lightweight Terminal Wiring Segmentation Model Based on Improved YOLO11-Seg
DOI: 10.12677/ojtt.2026.152017, PDF,   
作者: 王永伟, 潘为刚:山东交通学院轨道交通学院,山东 济南
关键词: YOLOv11-SegShuffleNetV2SimSPPF接线端子分割YOLOv11-Seg ShuffleNetV2 SimSPPF Terminal Wiring Segmentation
摘要: 为了解决复杂背景下端子接线分割模型准确率不足与边缘设备部署困难的问题,本文提出了一种改进的YOLOv11n网络分割模型。该模型引入洗牌网络版本2 (ShuffleNetv2)作为主干网络,结合点群卷积与通道混洗以降低参数量与运算量;同时,采用简化空间金字塔池化模块(Simplified Spatial Pyramid Pooling-Fast, SimSPPF)提取并融合多尺度上下文特征,从而增强模型对不同尺度目标区域的表征能力,提高分割结果的准确性。为了验证改进模型的准确性和快速性,本文构建了包含1290张图像的端子数据集,覆盖多同光照条件、拍摄角度和线缆颜色。实验结果表明,改进后的分割模型在参数量和计算量分别较YOLOv11n网络分割模型降低了18%和16.7%,在此基础上,其检测精度和mAP50-95分别达到98.4%和91.8%。同时,模型在嵌入式平台上的推理速度提升了29.9%,具备良好的实时性表现。在分割精度和轻量化性能上,该模型相较于现有轻量化算法均展现出显著优势,该研究可为线缆智能化检测提供技术支持。
Abstract: To address the issues of insufficient segmentation accuracy and deployment difficulty on edge devices in terminal block segmentation under complex backgrounds, this paper proposes an improved YOLOv11n-based segmentation network. ShuffleNetV2 is introduced as a lightweight backbone to reduce model parameters and computational cost through efficient pointwise convolutions and channel shuffling. Meanwhile, a Simplified Spatial Pyramid Pooling-Fast (SimSPPF) module is employed to enhance multi-scale contextual feature fusion, improving the representation of targets with varying scales. A terminal block dataset containing 1,290 images with diverse lighting conditions, viewpoints, and cable colors is constructed for evaluation. Experimental results demonstrate that the proposed segmentation model reduces the number of parameters and computational complexity by 18% and 16.7%, respectively, compared with the YOLOv11n segmentation model. On this basis, the model achieves a detection accuracy of 98.4% and an mAP@0.5-0.95 of 91.8%. Moreover, the inference speed on an embedded platform is improved by 29.9%, indicating favorable real-time performance. Overall, the proposed model exhibits clear advantages over existing lightweight methods in terms of segmentation accuracy and model efficiency, providing effective technical support for intelligent cable inspection.
文章引用:王永伟, 潘为刚. 基于改进YOLO11-Seg的轻量化端子接线分割模型[J]. 交通技术, 2026, 15(2): 181-192. https://doi.org/10.12677/ojtt.2026.152017

参考文献

[1] Harrod, S.S. (2025) Railway Signal Digitalization with the European Rail Traffic Management System and Positive Train Control: Industry 4.0 Expectations and Reality. Transportation Research Record, 2679, 1052-1065. [Google Scholar] [CrossRef
[2] 王松, 蒋潇, 吴丹. 面向机器人操作的柔性线缆视觉分割与抓取位姿生成算法[J]. 机械工程学报, 2025, 61(9): 231-240.
[3] 柏亚鑫. 基于视觉的双臂机器人柔性线缆检测与操作方法研究[D]: [硕士学位论文]. 哈尔滨: 哈尔滨工业大学, 2025.
[4] 王军文. 面向客机机舱的线缆语义分割方法研究[D]: [硕士学位论文]. 天津: 天津理工大学, 2025.
[5] 曹国群, 刘桂雄. 基于双模态融合的线缆图像语义分割方法研究[J]. 电子测量技术, 2023, 46(10): 184-188.
[6] Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. and Chen, L. (2018) MobileNetV2: Inverted Residuals and Linear Bottlenecks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 4510-4520. [Google Scholar] [CrossRef
[7] Zhang, X., Zhou, X., Lin, M. and Sun, J. (2018) ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 6848-6856. [Google Scholar] [CrossRef
[8] Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C. and Xu, C. (2020) GhostNet: More Features from Cheap Operations. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 1580-1589. [Google Scholar] [CrossRef
[9] Tan, M. and Le, Q.V. (2021) EfficientNetV2: Smaller Models and Faster Training.
[10] Ma, N., Zhang, X., Zheng, H. and Sun, J. (2018) ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design. In: Lecture Notes in Computer Science, Springer, 122-138. [Google Scholar] [CrossRef
[11] He, K., Zhang, X., Ren, S. and Sun, J. (2015) Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1904-1916. [Google Scholar] [CrossRef] [PubMed]
[12] Jooshin, H.K., Nangir, M. and Seyedarabi, H. (2024) Inception-YOLO: Computational Cost and Accuracy Improvement of the YOLOv5 Model Based on Employing Modified CSP, SPPF, and Inception Modules. IET Image Processing, 18, 1985-1999. [Google Scholar] [CrossRef
[13] Dong, X., Li, S. and Zhang, J. (2024) YOLOv5s Object Detection Based on Sim SPPF Hybrid Pooling. Optoelectronics Letters, 20, 367-371. [Google Scholar] [CrossRef
[14] Khanam, R. and Hussain, M. (2024) YOLOv11: An Overview of the Key Architectural Enhancements.
[15] Tian, Y., Ye, Q. and Doermann, D. (2025) YOLOv12: Attention-Centric Real-Time Object Detectors.