Cargando…

UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy

Unmanned aerial vehicle (UAV) autonomous tracking and landing is playing an increasingly important role in military and civil applications. In particular, machine learning has been successfully introduced to robotics-related tasks. A novel UAV autonomous tracking and landing approach based on a deep...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xie, Jingyi, Peng, Xiaodong, Wang, Haijiao, Niu, Wenlong, Zheng, Xiao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582896/ https://www.ncbi.nlm.nih.gov/pubmed/33019747 http://dx.doi.org/10.3390/s20195630

_version_	1783599296434667520
author	Xie, Jingyi Peng, Xiaodong Wang, Haijiao Niu, Wenlong Zheng, Xiao
author_facet	Xie, Jingyi Peng, Xiaodong Wang, Haijiao Niu, Wenlong Zheng, Xiao
author_sort	Xie, Jingyi
collection	PubMed
description	Unmanned aerial vehicle (UAV) autonomous tracking and landing is playing an increasingly important role in military and civil applications. In particular, machine learning has been successfully introduced to robotics-related tasks. A novel UAV autonomous tracking and landing approach based on a deep reinforcement learning strategy is presented in this paper, with the aim of dealing with the UAV motion control problem in an unpredictable and harsh environment. Instead of building a prior model and inferring the landing actions based on heuristic rules, a model-free method based on a partially observable Markov decision process (POMDP) is proposed. In the POMDP model, the UAV automatically learns the landing maneuver by an end-to-end neural network, which combines the Deep Deterministic Policy Gradients (DDPG) algorithm and heuristic rules. A Modular Open Robots Simulation Engine (MORSE)-based reinforcement learning framework is designed and validated with a continuous UAV tracking and landing task on a randomly moving platform in high sensor noise and intermittent measurements. The simulation results show that when the moving platform is moving in different trajectories, the average landing success rate of the proposed algorithm is about 10% higher than that of the Proportional-Integral-Derivative (PID) method. As an indirect result, a state-of-the-art deep reinforcement learning-based UAV control method is validated, where the UAV can learn the optimal strategy of a continuously autonomous landing and perform properly in a simulation environment.
format	Online Article Text
id	pubmed-7582896
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75828962020-10-28 UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy Xie, Jingyi Peng, Xiaodong Wang, Haijiao Niu, Wenlong Zheng, Xiao Sensors (Basel) Article Unmanned aerial vehicle (UAV) autonomous tracking and landing is playing an increasingly important role in military and civil applications. In particular, machine learning has been successfully introduced to robotics-related tasks. A novel UAV autonomous tracking and landing approach based on a deep reinforcement learning strategy is presented in this paper, with the aim of dealing with the UAV motion control problem in an unpredictable and harsh environment. Instead of building a prior model and inferring the landing actions based on heuristic rules, a model-free method based on a partially observable Markov decision process (POMDP) is proposed. In the POMDP model, the UAV automatically learns the landing maneuver by an end-to-end neural network, which combines the Deep Deterministic Policy Gradients (DDPG) algorithm and heuristic rules. A Modular Open Robots Simulation Engine (MORSE)-based reinforcement learning framework is designed and validated with a continuous UAV tracking and landing task on a randomly moving platform in high sensor noise and intermittent measurements. The simulation results show that when the moving platform is moving in different trajectories, the average landing success rate of the proposed algorithm is about 10% higher than that of the Proportional-Integral-Derivative (PID) method. As an indirect result, a state-of-the-art deep reinforcement learning-based UAV control method is validated, where the UAV can learn the optimal strategy of a continuously autonomous landing and perform properly in a simulation environment. MDPI 2020-10-01 /pmc/articles/PMC7582896/ /pubmed/33019747 http://dx.doi.org/10.3390/s20195630 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Xie, Jingyi Peng, Xiaodong Wang, Haijiao Niu, Wenlong Zheng, Xiao UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
title	UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
title_full	UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
title_fullStr	UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
title_full_unstemmed	UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
title_short	UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
title_sort	uav autonomous tracking and landing based on deep reinforcement learning strategy
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582896/ https://www.ncbi.nlm.nih.gov/pubmed/33019747 http://dx.doi.org/10.3390/s20195630
work_keys_str_mv	AT xiejingyi uavautonomoustrackingandlandingbasedondeepreinforcementlearningstrategy AT pengxiaodong uavautonomoustrackingandlandingbasedondeepreinforcementlearningstrategy AT wanghaijiao uavautonomoustrackingandlandingbasedondeepreinforcementlearningstrategy AT niuwenlong uavautonomoustrackingandlandingbasedondeepreinforcementlearningstrategy AT zhengxiao uavautonomoustrackingandlandingbasedondeepreinforcementlearningstrategy

UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy

Ejemplares similares