YOLOv5s-gse——基于YOLOv5s-gse的目标检测和机械臂吸附研究
YOLOv5s-gse—Study on Object Detection and Robotic Arm Suction Based on YOLOv5s-gse
DOI: 10.12677/csa.2025.1512343, PDF,   
作者: 高 凝, 王宝库:沈阳航空航天大学电子信息工程学院,辽宁 沈阳;李 鹤*:沈阳工学院信息与控制学院,辽宁 抚顺
关键词: 目标检测C3GSConvECA注意力机制目标吸取逆运动学Object Detection C3GSConv ECA Attention Mechanism Object Suction Inverse Kinematics
摘要: 针对传统目标检测算法在杂乱工业环境中存在的识别精度不足和定位不稳定等问题,本文提出了一种面向机械臂抓取任务的轻量化改进模型YOLOv5s-gse,以提升机械臂在复杂场景下的目标感知与操作能力。该模型以YOLOv5s为基础,在主干网络中引入由轻量卷积GSConv、Shuffle通道重排以及ECA注意力机制构成的C3GSConv模块,以增强特征提取能力;同时降低模型参数量和计算开销。在检测完成后,通过外参标定实现相机与机械臂坐标的转换,并基于逆运动学求解各关节角度,构建Episode1机械臂抓取实验平台进行验证。实验结果表明,YOLOv5s-gse模型在保持较高前向推理速度的同时,其mAP较原YOLOv5s提升0.03%,漏检率与误检率均略有降低;结合机械臂的吸取实验成功率极高,充分验证了所提方法在机器人抓取场景中的有效性。该方案兼具检测精度与轻量化优势,成功将多种轻量化技术整合并应用于一个功能完备的机器人抓取系统。
Abstract: To address the limitations of traditional object detection algorithms in cluttered industrial environments—specifically insufficient recognition accuracy and unstable localization—this paper proposes a lightweight improved model, YOLOv5s-gse, designed for robotic arm grasping tasks. Building upon YOLOv5s, the model integrates a C3GSConv module into the backbone network, which is composed of lightweight GSConv, Shuffle-based channel reordering, and an ECA attention mechanism, thereby enhancing feature extraction while reducing model parameters and computational cost. After object detection, extrinsic calibration is performed to map coordinates between the camera and the robotic arm, and inverse kinematics is applied to compute joint angles. An Episode1 robotic grasping experimental platform is constructed for evaluation. Experimental results show that YOLOv5s-gse achieves higher mAP—an improvement of 0.03% over the original YOLOv5s—while maintaining high inference speed, with slight reductions in both missed detections and false detections. When combined with the robotic arm’s suction-based grasping mechanism, the system demonstrates a very high success rate, validating the effectiveness of the proposed method in robotic grasping scenarios. The approach offers both accuracy and lightweight advantages, successfully integrating multiple lightweight techniques into a fully functional robotic grasping system.
文章引用:高凝, 王宝库, 李鹤. YOLOv5s-gse——基于YOLOv5s-gse的目标检测和机械臂吸附研究[J]. 计算机科学与应用, 2025, 15(12): 274-287. https://doi.org/10.12677/csa.2025.1512343

参考文献

[1] Sun, R., Wu, C., Zhao, X., Zhao, B. and Jiang, Y. (2024) Object Recognition and Grasping for Collaborative Robots Based on Vision. Sensors, 24, Article 195. [Google Scholar] [CrossRef] [PubMed]
[2] Chen, Y., Cai, Y. and Cheng, M. (2023) Vision-Based Robotic Object Grasping—A Deep Reinforcement Learning Approach. Machines, 11, Article 275. [Google Scholar] [CrossRef
[3] Song, Q., Li, S., Bai, Q., Yang, J., Zhang, X., Li, Z., et al. (2021) Object Detection Method for Grasping Robot Based on Improved YOLOv5. Micromachines, 12, Article 1273. [Google Scholar] [CrossRef] [PubMed]
[4] 丁昊, 李小光, 王术波. 基于YOLOv5目标检测的机械臂精准授粉系统设计与试验[J]. 山东农业大学学报(自然科学版), 2024, 55(3): 347-355.
[5] Wang, Z., Ge, Z., Liu, S., Li, Z. and Sun, J. (2021) YOLOv5-Lite: A Lightweight Object Detection Network for Edge Devices. arXiv: 2110.06191.
[6] Zhang, X., Zhou, X., Lin, M. and Sun, J. (2018) ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 6848-6856.
[7] Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W. and Hu, Q. (2020) ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 11531-11539. [Google Scholar] [CrossRef
[8] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł. and Polosukhin, I. (2017) Attention Is All You Need. Advances in Neural Information Processing Systems (NeurIPS), 30, 5998-6008.
[9] Hartley, R. and Zisserman, A. (2004) Multiple View Geometry in Computer Vision. 2nd Edition, Cambridge University Press. [Google Scholar] [CrossRef
[10] 孙超, 胡志明, 彭麟谊, 等. 六自由度机械臂快速路径规划与手眼标定研究[J]. 制造技术与机床, 2025(2): 48-56.
[11] 张晓枫, 钟颖媛. 基于三维视觉的机械臂抓取策略研究[J]. 造纸装备及材料, 2023, 52(7): 21-23.
[12] Karhan, H. and Bingül, Z. (2025) Automatic Determination of the Denavit-Hartenberg Parameters for the Forward Kinematics of All Serial Robots: Novel Kinematics Toolbox. Machines, 13, Article 944. [Google Scholar] [CrossRef
[13] Danaci, H., Nguyen, L.A., Harman, T.L. and Pagan, M. (2023) Inverse Kinematics for Serial Robot Manipulators by Particle Swarm Optimization and POSIX Threads Implementation. Applied Sciences, 13, Article 4515. [Google Scholar] [CrossRef
[14] 李万艳, 阮观强, 张振东. 基于YOLOv5与迁移学习的目标检测和机械臂抓取位姿估计[J]. 机械传动, 2024, 48(3): 170-176.