Coordinated Control Based on Reinforcement Learning for Dual-Arm Continuum Manipulators in Space Capture Missions

Jiang, Da; Cai, Zhiqin; Peng, Haijun; Wu, Zhigang

doi:10.1061/(ASCE)AS.1943-5525.0001335

Technical Papers

Aug 11, 2021

Coordinated Control Based on Reinforcement Learning for Dual-Arm Continuum Manipulators in Space Capture Missions

Authors: Da Jiang [email protected], Zhiqin Cai [email protected], Haijun Peng [email protected], and Zhigang Wu [email protected]Author Affiliations

Publication: Journal of Aerospace Engineering

Volume 34, Issue 6

https://doi.org/10.1061/(ASCE)AS.1943-5525.0001335

Get Access

Abstract

The increasing number of defunct and fragmented spacecraft poses a growing hazard to existing onorbit assets. The redundant continuum manipulator with high flexibility provides dual-arm robotic systems with apparent advantages in active debris removal missions in space. Existing autonomously-coordinated control approaches for dual-arm continuum manipulators require a real-time inverse kinematic solution and a security assurance mechanism for possible collisions, which are difficult to upscale for space debris capture systems with high-speed maneuverability. In this paper, we consider collision avoidance and input saturation control in proposing a multiagent reinforcement learning approach, named the multiagent twin delayed deep deterministic policy gradient (MATD3), to generate a real-time inverse kinematic solution for coordinated manipulators. During the training process, the MATD3 algorithm performs lower overestimation than the multiagent deep deterministic policy gradient (MADDPG) algorithm. Then, a feedback dynamics controller is designed for the continuum manipulators. Under the guidance of the policy networks, each agent can schedule the joint trajectory design online according to the collaborator and target debris information. During the capture operation, a competitive mechanism for the anticollision function is developed through reasonable reward functions to maintain dual arms at a safe distance. Simulation results show that the average accuracy of the proposed approach is 42% higher than that of MADDPG in inverse kinematic trajectory planning. The designed integrated tracking controller can effectively perform capture missions in the simulation environment. Multiagent reinforcement learning shows promise for future onorbit servicing missions.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

All data, models, and code generated or used during the study appear in the published article.

Acknowledgments

This work was supported by the National Natural Science Foundation Key Foundation (No. 91748203) and the Qian Xuesen Laboratory of Space Technology Seed Fund (QXSZZJJ03-07).

References

Amouri, A., C. Mahfoudi, A. Zaatri, and H. Merabti. 2014. “A new approach to solve inverse kinematics of a planar flexible continuum robot.” In Vol. 1618 of Proc., AIP Conf., 643–646. College Park, MD: American Institute of Physics.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Cited by

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!