Cargando…

Path Following Control for Underactuated Airships with Magnitude and Rate Saturation

This paper proposes a reinforcement learning (RL) based path following strategy for underactuated airships with magnitude and rate saturation. The Markov decision process (MDP) model for the control problem is established. Then an error bounded line-of-sight (LOS) guidance law is investigated to res...

Descripción completa

Detalles Bibliográficos
Autores principales: Gou, Huabei, Guo, Xiao, Lou, Wenjie, Ou, Jiajun, Yuan, Jiace
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7765289/
https://www.ncbi.nlm.nih.gov/pubmed/33333882
http://dx.doi.org/10.3390/s20247176
_version_ 1783628456949448704
author Gou, Huabei
Guo, Xiao
Lou, Wenjie
Ou, Jiajun
Yuan, Jiace
author_facet Gou, Huabei
Guo, Xiao
Lou, Wenjie
Ou, Jiajun
Yuan, Jiace
author_sort Gou, Huabei
collection PubMed
description This paper proposes a reinforcement learning (RL) based path following strategy for underactuated airships with magnitude and rate saturation. The Markov decision process (MDP) model for the control problem is established. Then an error bounded line-of-sight (LOS) guidance law is investigated to restrain the state space. Subsequently, a proximal policy optimization (PPO) algorithm is employed to approximate the optimal action policy through trial and error. Since the optimal action policy is generated from the action space, the magnitude and rate saturation can be avoided. The simulation results, involving circular, general, broken-line, and anti-wind path following tasks, demonstrate that the proposed control scheme can transfer to new tasks without adaptation, and possesses satisfying real-time performance and robustness.
format Online
Article
Text
id pubmed-7765289
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77652892020-12-27 Path Following Control for Underactuated Airships with Magnitude and Rate Saturation Gou, Huabei Guo, Xiao Lou, Wenjie Ou, Jiajun Yuan, Jiace Sensors (Basel) Article This paper proposes a reinforcement learning (RL) based path following strategy for underactuated airships with magnitude and rate saturation. The Markov decision process (MDP) model for the control problem is established. Then an error bounded line-of-sight (LOS) guidance law is investigated to restrain the state space. Subsequently, a proximal policy optimization (PPO) algorithm is employed to approximate the optimal action policy through trial and error. Since the optimal action policy is generated from the action space, the magnitude and rate saturation can be avoided. The simulation results, involving circular, general, broken-line, and anti-wind path following tasks, demonstrate that the proposed control scheme can transfer to new tasks without adaptation, and possesses satisfying real-time performance and robustness. MDPI 2020-12-15 /pmc/articles/PMC7765289/ /pubmed/33333882 http://dx.doi.org/10.3390/s20247176 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gou, Huabei
Guo, Xiao
Lou, Wenjie
Ou, Jiajun
Yuan, Jiace
Path Following Control for Underactuated Airships with Magnitude and Rate Saturation
title Path Following Control for Underactuated Airships with Magnitude and Rate Saturation
title_full Path Following Control for Underactuated Airships with Magnitude and Rate Saturation
title_fullStr Path Following Control for Underactuated Airships with Magnitude and Rate Saturation
title_full_unstemmed Path Following Control for Underactuated Airships with Magnitude and Rate Saturation
title_short Path Following Control for Underactuated Airships with Magnitude and Rate Saturation
title_sort path following control for underactuated airships with magnitude and rate saturation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7765289/
https://www.ncbi.nlm.nih.gov/pubmed/33333882
http://dx.doi.org/10.3390/s20247176
work_keys_str_mv AT gouhuabei pathfollowingcontrolforunderactuatedairshipswithmagnitudeandratesaturation
AT guoxiao pathfollowingcontrolforunderactuatedairshipswithmagnitudeandratesaturation
AT louwenjie pathfollowingcontrolforunderactuatedairshipswithmagnitudeandratesaturation
AT oujiajun pathfollowingcontrolforunderactuatedairshipswithmagnitudeandratesaturation
AT yuanjiace pathfollowingcontrolforunderactuatedairshipswithmagnitudeandratesaturation