Cargando…

Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing

Planetary soft landing has been studied extensively due to its promising application prospects. In this paper, a soft landing control algorithm based on deep reinforcement learning (DRL) with good convergence property is proposed. First, the soft landing problem of the powered descent phase is formu...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Xibao, Chen, Yushen, Bai, Chengchao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662435/
https://www.ncbi.nlm.nih.gov/pubmed/34884162
http://dx.doi.org/10.3390/s21238161
_version_ 1784613435867136000
author Xu, Xibao
Chen, Yushen
Bai, Chengchao
author_facet Xu, Xibao
Chen, Yushen
Bai, Chengchao
author_sort Xu, Xibao
collection PubMed
description Planetary soft landing has been studied extensively due to its promising application prospects. In this paper, a soft landing control algorithm based on deep reinforcement learning (DRL) with good convergence property is proposed. First, the soft landing problem of the powered descent phase is formulated and the theoretical basis of Reinforcement Learning (RL) used in this paper is introduced. Second, to make it easier to converge, a reward function is designed to include process rewards like velocity tracking reward, solving the problem of sparse reward. Then, by including the fuel consumption penalty and constraints violation penalty, the lander can learn to achieve velocity tracking goal while saving fuel and keeping attitude angle within safe ranges. Then, simulations of training are carried out under the frameworks of Deep deterministic policy gradient (DDPG), Twin Delayed DDPG (TD3), and Soft Actor Critic (SAC), respectively, which are of the classical RL frameworks, and all converged. Finally, the trained policy is deployed into velocity tracking and soft landing experiments, results of which demonstrate the validity of the algorithm proposed.
format Online
Article
Text
id pubmed-8662435
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86624352021-12-11 Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing Xu, Xibao Chen, Yushen Bai, Chengchao Sensors (Basel) Article Planetary soft landing has been studied extensively due to its promising application prospects. In this paper, a soft landing control algorithm based on deep reinforcement learning (DRL) with good convergence property is proposed. First, the soft landing problem of the powered descent phase is formulated and the theoretical basis of Reinforcement Learning (RL) used in this paper is introduced. Second, to make it easier to converge, a reward function is designed to include process rewards like velocity tracking reward, solving the problem of sparse reward. Then, by including the fuel consumption penalty and constraints violation penalty, the lander can learn to achieve velocity tracking goal while saving fuel and keeping attitude angle within safe ranges. Then, simulations of training are carried out under the frameworks of Deep deterministic policy gradient (DDPG), Twin Delayed DDPG (TD3), and Soft Actor Critic (SAC), respectively, which are of the classical RL frameworks, and all converged. Finally, the trained policy is deployed into velocity tracking and soft landing experiments, results of which demonstrate the validity of the algorithm proposed. MDPI 2021-12-06 /pmc/articles/PMC8662435/ /pubmed/34884162 http://dx.doi.org/10.3390/s21238161 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xu, Xibao
Chen, Yushen
Bai, Chengchao
Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing
title Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing
title_full Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing
title_fullStr Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing
title_full_unstemmed Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing
title_short Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing
title_sort deep reinforcement learning-based accurate control of planetary soft landing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662435/
https://www.ncbi.nlm.nih.gov/pubmed/34884162
http://dx.doi.org/10.3390/s21238161
work_keys_str_mv AT xuxibao deepreinforcementlearningbasedaccuratecontrolofplanetarysoftlanding
AT chenyushen deepreinforcementlearningbasedaccuratecontrolofplanetarysoftlanding
AT baichengchao deepreinforcementlearningbasedaccuratecontrolofplanetarysoftlanding