Attitude Control of a Moving Mass–Actuated UAV Based on Deep Reinforcement Learning

Qiu, Xiaoqi; Gao, Changsheng; Wang, Kefan; Jing, Wuxing

doi:10.1061/(ASCE)AS.1943-5525.0001381

Technical Papers

Dec 1, 2021

Attitude Control of a Moving Mass–Actuated UAV Based on Deep Reinforcement Learning

Authors: Xiaoqi Qiu https://orcid.org/0000-0002-6912-316X [email protected], Changsheng Gao [email protected], Kefan Wang [email protected], and Wuxing Jing [email protected]Author Affiliations

Publication: Journal of Aerospace Engineering

Volume 35, Issue 2

https://doi.org/10.1061/(ASCE)AS.1943-5525.0001381

Get Access

Abstract

A moving mass–actuated unmanned aerial vehicle (MAUAV) is controlled by mass sliders installed inside the airframe and has the advantages of high aerodynamic efficiency and good stealth performance. However, designing a controller for it faces severe challenges due to the strong nonlinearity and coupling of its dynamics. To this end, we proposed an attitude controller based on deep reinforcement learning for the MAUAV. It directly maps the states to the needed deflection of the actuators and is an end-to-end controller. For the sparse reward problem, the reward function required for training is reasonably designed through reward shaping to hasten the algorithm’s training speed. In training, random initialization and parameter perturbation are used to strengthen the final policy’s robustness further. The simulation results tentatively demonstrate that the proposed controller is not only robust but suboptimal. Compared with an active disturbance rejection controller (ADRC) optimized by the particle swarm algorithm, our controller still guarantees a 100% success rate in multiple unlearned scenarios, meaning it has good generalization ability.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (No. 11572097).

References

Abbeel, P., A. Coates, M. Quigley, and A. Y. Ng. 2006. “An application of reinforcement learning to aerobatic helicopter flight.” In Proc., 19th Int. Conf. on Neural Information Processing Systems, 1–8. Cambridge, MA: MIT Press.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Cited by

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!