基于近端策略优化算法及视觉感知的机械臂导纳控制研究
Research on Admittance Control of Manipulator Based on Proximal Policy Optimization Algorithm and Visual Perception
DOI: 10.12677/mos.2024.136595, PDF,    国家自然科学基金支持
作者: 黄一超, 赵宇涵, 卢 洋:上海理工大学健康科学与工程学院,上海;上海健康医学院协同科研中心,上海;张嘉琪, 赵文龙*, 周 亮*:上海健康医学院协同科研中心,上海
关键词: 深度强化学习近端策略优化导纳控制视觉伺服控制MuJoCo仿真Deep Reinforcement Learning Proximal Policy Optimization Admittance Control Visual Servo Control MuJoCo Simulation
摘要: 现代机械臂交互任务中,由于环境的复杂性和不确定性,精确的物体表面建模常常难以实现。因此,如何在不依赖精确模型的情况下,提高机械臂与环境交互时的适应性和稳定性成为机械臂与环境交互任务的研究重点之一。本文针对机械臂交互任务,旨在实现视觉引导下的精细力控。研究工作首先基于MuJoCo (Multi-Joint Dynamics with Contact)物理引擎搭建了机械臂交互仿真环境,并创新性的融合了基于位置的视觉伺服(Position-Based Visual Servo, PBVS)控制和导纳控制。通过深度强化学习(Deep Reinforcement Learning, DRL)中的近端策略优化(Proximal Policy Optimization, PPO)算法,有效整合了视觉信息和力信息,从而提出了一种结合了视觉感知的导纳控制策略。通过对比实验验证,结合视觉感知的导纳控制相较于视觉伺服控制,力控整体性能提升68.75%;相较于经典的导纳控制,峰值力控制精度提高15%。实验结果表明,结合视觉感知的导纳控制在平面和不规则凹面环境中均表现出色:不仅能精确执行视觉引导下的力控任务,还能在多样化的接触面上保持稳定的交互力并迅速适应环境变化。在精密装配、医疗辅助和服务机械臂等领域,能够提高机械臂在复杂、不确定环境中的适应性和稳定性,从而推动智能机械臂自主操作的进一步发展。
Abstract: In modern manipulator interaction tasks, due to the complexity and uncertainty of the environment, accurate object surface modeling is often difficult to achieve. Therefore, improving the adaptability and stability of the interaction between the manipulator and the environment has become one of the research focuses of the interaction task. Aiming at the interactive task of the manipulator, this paper aims to realize the fine force control under visual guidance. Therefore, based on the MuJoCo (Multi-Joint Dynamics with Contact) physics engine, we built an interactive simulation environment for the manipulator, and innovatively integrated the position-based visual servo control and admittance control. Through the Proximal Policy Optimization (PPO) algorithm in Deep Reinforcement Learning (DRL), the visual information and force information are effectively integrated, and an admittance control strategy combining visual perception is proposed. Through comparative experiments, the admittance control combined with visual perception is compared with visual servo control, and the overall performance of force control is improved by 68.75%. Compared with the classical admittance control, the peak force control accuracy is improved by 15%. The experimental results showed that the admittance control combined with visual perception performs well in both flat and irregular concave environments: it can not only accurately perform visual-guided force control tasks, but also maintain stable interaction forces on a variety of contact surfaces and quickly adapt to environmental changes. In the fields of precision assembly, medical assistance and service manipulator, it can improve the adaptability and stability of manipulator in complex and uncertain environments, thus promoting the further development of autonomous operation of intelligent manipulator.
文章引用:黄一超, 张嘉琪, 赵宇涵, 卢洋, 赵文龙, 周亮. 基于近端策略优化算法及视觉感知的机械臂导纳控制研究[J]. 建模与仿真, 2024, 13(6): 6512-6524. https://doi.org/10.12677/mos.2024.136595

参考文献

[1] Prasanth, N., Shrivastava, K., Sharma, A., Basu, A., Sinha, R.A. and Raja, S.P. (2023) Gesture-Based Mouse Control System Based on MPU6050 and Kalman Filter Technique. International Journal of Intelligent Systems Technologies and Applications, 21, 56-71. [Google Scholar] [CrossRef
[2] Duan, J., Gan, Y., Chen, M. and Dai, X. (2018) Adaptive Variable Impedance Control for Dynamic Contact Force Tracking in Uncertain Environment. Robotics and Autonomous Systems, 102, 54-65. [Google Scholar] [CrossRef
[3] 孟奥, 吴志宇, 周州, 张付超, 胡冰山, 喻洪流. 变刚度关节机构设计与变阻抗增益控制仿真研究[J]. 建模与仿真, 2023, 12(6): 5165-5176.
[4] Ferraguti, F., Talignani Landi, C., Sabattini, L., Bonfè, M., Fantuzzi, C. and Secchi, C. (2019) A Variable Admittance Control Strategy for Stable Physical Human-Robot Interaction. The International Journal of Robotics Research, 38, 747-765. [Google Scholar] [CrossRef
[5] Wen, K., Necsulescu, D. and Sasiadek, J. (2007) Haptic Force Control Based on Impedance/Admittance Control Aided by Visual Feedback. Multimedia Tools and Applications, 37, 39-52. [Google Scholar] [CrossRef
[6] Qi, J., Xu, Z., Chu, J., Zhu, M. and Teng, Y. (2023) Visual-Admittance-Based Model Predictive Control for Nuclear Collaborative Robots. Frontiers in Energy Research, 11, Article ID: 1203904. [Google Scholar] [CrossRef
[7] Seraji, H. (1994) Adaptive Admittance Control: An Approach to Explicit Force Control in Compliant Motion. Proceedings of the 1994 IEEE International Conference on Robotics and Automation, San Diego, 8-13 May 1994, 2705-2712. [Google Scholar] [CrossRef
[8] Alshbatat, A.I.N. (2021) Adaptive Vision-Based System for Landing an Autonomous Hexacopter Drone on a Specific Landing Platform. International Journal of Intelligent Systems Technologies and Applications, 20, 245-270. [Google Scholar] [CrossRef
[9] Ott, C., Mukherjee, R. and Nakamura, Y. (2014) A Hybrid System Framework for Unified Impedance and Admittance Control. Journal of Intelligent & Robotic Systems, 78, 359-375. [Google Scholar] [CrossRef
[10] Xu, M., Hu, A. and Wang, H. (2023) Visual-Impedance-Based Human-Robot Cotransportation with a Tethered Aerial Vehicle. IEEE Transactions on Industrial Informatics, 19, 10356-10365. [Google Scholar] [CrossRef
[11] Zimmermann, S., Poranne, R. and Coros, S. (2021) Go Fetch!—Dynamic Grasps Using Boston Dynamics Spot with External Robotic Arm. 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, 30 May-5 June 2021, 4488-4494. [Google Scholar] [CrossRef
[12] De Schutter, J. and Baeten, J. (2003) Integrated Visual Servoing and Force Control: The Task Frame Approach. Springer Science & Business Media.
[13] Wu, M., He, Y. and Liu, S. (2019) Shared Impedance Control Based on Reinforcement Learning in a Human-Robot Collaboration Task. In: Berns, K. and Görges, D., Eds., Advances in Intelligent Systems and Computing, Springer International Publishing, 95-103. [Google Scholar] [CrossRef
[14] Kozlovsky, S., Newman, E. and Zacksenhouse, M. (2022) Reinforcement Learning of Impedance Policies for Peg-In-Hole Tasks: Role of Asymmetric Matrices. IEEE Robotics and Automation Letters, 7, 10898-10905. [Google Scholar] [CrossRef
[15] Bogdanovic, M., Khadiv, M. and Righetti, L. (2020) Learning Variable Impedance Control for Contact Sensitive Tasks. IEEE Robotics and Automation Letters, 5, 6129-6136. [Google Scholar] [CrossRef
[16] Joglekar, O., Kozlovsky, S., Lancewicki, T., et al. (2024) Towards Natural Language-Driven Industrial Assembly Using Foundation Models. ICLR 2024 Workshop on Large Language Model (LLM) Agents, Vienna, 7-11 May 2024.
[17] Khader, S.A., Yin, H., Falco, P. and Kragic, D. (2021) Stability-Guaranteed Reinforcement Learning for Contact-Rich Manipulation. IEEE Robotics and Automation Letters, 6, 1-8. [Google Scholar] [CrossRef
[18] Feng, X., Shi, T., Li, W., Lu, P. and Pan, Y. (2022) Reinforcement Learning-Based Impedance Learning for Robot Admittance Control in Industrial Assembly. 2022 International Conference on Advanced Robotics and Mechatronics (ICARM), Guilin, 9-11 July 2022, 1092-1097. [Google Scholar] [CrossRef