基于DDPG的空间双臂机器人变刚度柔顺控制技术

王嗣昱; 牛海瑞; 王凯; 谷海宇*; 游斌弟; 翟福军; 唐家瑞

引用本文:

【打印本页】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

←前一篇|后一篇→

过刊浏览高级检索

本文已被：浏览 16次下载 9次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
基于DDPG的空间双臂机器人变刚度柔顺控制技术
王嗣昱¹, 牛海瑞², 王凯², 谷海宇*³, 游斌弟¹, 翟福军², 唐家瑞²
1. 哈尔滨工业大学（威海）海洋工程学院，威海 264209;2. 北京卫星制造厂有限公司，北京 100094;3. 哈尔滨工业大学航天学院，哈尔滨 150001

摘要:

在空间机器人双臂协同装配任务中，机械臂与装配组件之间的接触状态呈现复杂的非线性，现有的控制技术未能解决装配过程中产生过大动态接触扰动的问题，进而导致装配精度下降、任务失败甚至设备损坏。为解决此问题，提出了一种基于深度确定性策略梯度（DDPG）算法的协同控制方法，构建了适应复杂接触状态变化的双臂协同机构，推导了包含机械臂与装配组件的动力学模型，进行了空间双臂机器人在快速、中速、慢速对接工况下的动力学分析，采用DDPG强化学习算法，降低机器人在对接过程中的接触力。通过仿真与实验验证，该方法可以显著提升装配精度（提高20%）、装配成功率（达到80.25%）、降低瞬间接触力大小（降低30%），有效调节空间双臂机器人协同装配过程的稳定性与可靠性。

关键词: 强化学习空间机器人双臂协同控制触碰撞模型机器人动力学

基金项目:国家自然科学基金项目（52475009；52175082）。

Variable Stiffness Compliant Control Technology for Space Dual-arm Robots Based on DDPG

Wang Siyu¹, Niu Hairui², Wang Kai², Gu Haiyu*³, You Bindi¹, Zhai Fujun², Tang Jiarui²

1. School of Ocean Engineering, Harbin Institute of Technology (Weihai), Weihai 264209;2. Beijing Spacecraft Manufacturing Co., Ltd., Beijing 100094;3. School of Astronautics, Harbin Institute of Technology, Harbin 150001

Abstract:

In space robotic dual-arm cooperative assembly tasks, there exists a complex nonlinear contact state between the robotic arms and the assembly components. Existing control techniques have not successfully addressed the issue of excessive dynamic contact disturbances during the assembly process, leading to reduced assembly accuracy, task failure, and even equipment damage. To resolve this issue, this paper proposes a cooperative control method based on the Deep Deterministic Policy Gradient (DDPG) algorithm. A dual-arm cooperative mechanism adaptable to complex contact state changes is constructed, and the dynamic model of the robotic arms and assembly components is derived. The paper conducts a dynamic analysis of the space dual-arm robot under rapid, medium, and slow docking conditions. The DDPG reinforcement learning algorithm is applied to reduce the contact force during the docking process. Through simulation and experimental validation, the proposed method significantly enhances assembly accuracy (by 20%), assembly success rate (reaching 80.25%), and reduces instantaneous contact force (by 30%). The approach effectively regulates the stability and reliability of the space dual-arm robot cooperative assembly process.

Key words: reinforcement learning space robot dual-arm cooperative control contact collision model robot dynamics