TD3-Based Model Predictive Control for Satellite Formation-Keeping

Hu, Xing; Zhai, Zhi; Liu, Jinxin; Wang, Chenxi; Liu, Naijin; Chen, Xuefeng

doi:10.1061/JAEEEZ.ASENG-5646

Technical Papers

Aug 5, 2024

TD3-Based Model Predictive Control for Satellite Formation-Keeping

Authors: Xing Hu [email protected], Zhi Zhai https://orcid.org/0000-0003-1746-260X [email protected], Jinxin Liu [email protected], Chenxi Wang [email protected], Naijin Liu [email protected], and Xuefeng Chen https://orcid.org/0000-0002-0130-3172 [email protected]Author Affiliations

Publication: Journal of Aerospace Engineering

Volume 37, Issue 6

https://doi.org/10.1061/JAEEEZ.ASENG-5646

Get Access

Abstract

The escalating prevalence of formation flights in space missions has led researchers to intensify their focus on designing optimal control systems for satellite formation motion along reference orbits, with the aim of reducing tracking error and energy consumption. However, conventional controllers typically excel at optimizing only one of these objectives, and the manual parameter tuning of such controllers proves to be a challenging task. In this paper, we introduce a novel approach, the twin delayed deep deterministic policy gradient-based model predictive control (TD3-MPC) method. To tackle the multiobjective formation-keeping challenge, a linear model predictive controller based on the satellite’s dynamics had been developed. Subsequently, a cost function is formulated to facilitate the optimization of multiple objectives, specifically tracking error and fuel consumption. In addressing the intricate issue of controller parameter tuning, we employ reinforcement learning and design a reward function reflective of the TD3 algorithm’s controller performance. Simulation results underscore the superior performance of the proposed TD3-MPC algorithm compared to the linear model predictive controller, achieving a notable 27.83% reduction in tracking error and a substantial 48.30% decrease in fuel consumption under large error condition and 3.67% reduction in tracking error and a substantial 22.27% decrease in fuel consumption under small error condition. By effectively combining the strengths of reinforcement learning and model predictive control, TD3-MPC enhances the satellite’s ability to adhere more precisely to its intended trajectory, thereby ensuring the stability and desired operational performance of the satellite formation.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

This work is supported by the National Natural Science Foundation of China (No. U22B2013), the China Postdoctoral Science Foundation (No. 2021M692589).

References

Alfriend, K., S. R. Vadali, P. Gurfil, J. How, and L. Breger. 2009. Spacecraft formation flying: Dynamics, control and navigation. Amsterdam, Netherlands: Elsevier.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

ASCE Technical Topics:

Authors

Affiliations

Metrics

Citations

Download citation

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!