Cargando…

Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking

Uncertainty of target motion, limited perception ability of onboard cameras, and constrained control have brought new challenges to unmanned aerial vehicle (UAV) dynamic target tracking control. In virtue of the powerful fitting ability and learning ability of the neural network, this paper proposes...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Jiang, Liu, Han, Sun, Jiaming, Wu, Kun, Cai, Zhihao, Ma, Yan, Wang, Yingxun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9680462/ https://www.ncbi.nlm.nih.gov/pubmed/36412725 http://dx.doi.org/10.3390/biomimetics7040197

_version_	1784834423823269888
author	Zhao, Jiang Liu, Han Sun, Jiaming Wu, Kun Cai, Zhihao Ma, Yan Wang, Yingxun
author_facet	Zhao, Jiang Liu, Han Sun, Jiaming Wu, Kun Cai, Zhihao Ma, Yan Wang, Yingxun
author_sort	Zhao, Jiang
collection	PubMed
description	Uncertainty of target motion, limited perception ability of onboard cameras, and constrained control have brought new challenges to unmanned aerial vehicle (UAV) dynamic target tracking control. In virtue of the powerful fitting ability and learning ability of the neural network, this paper proposes a new deep reinforcement learning (DRL)-based end-to-end control method for UAV dynamic target tracking. Firstly, a DRL-based framework using onboard camera image is established, which simplifies the traditional modularization paradigm. Secondly, neural network architecture, reward functions, and soft actor-critic (SAC)-based speed command perception algorithm are designed to train the policy network. The output of the policy network is denormalized and directly used as speed control command, which realizes the UAV dynamic target tracking. Finally, the feasibility of the proposed end-to-end control method is demonstrated by numerical simulation. The results show that the proposed DRL-based framework is feasible to simplify the traditional modularization paradigm. The UAV can track the dynamic target with rapidly changing of speed and direction.
format	Online Article Text
id	pubmed-9680462
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-96804622022-11-23 Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking Zhao, Jiang Liu, Han Sun, Jiaming Wu, Kun Cai, Zhihao Ma, Yan Wang, Yingxun Biomimetics (Basel) Article Uncertainty of target motion, limited perception ability of onboard cameras, and constrained control have brought new challenges to unmanned aerial vehicle (UAV) dynamic target tracking control. In virtue of the powerful fitting ability and learning ability of the neural network, this paper proposes a new deep reinforcement learning (DRL)-based end-to-end control method for UAV dynamic target tracking. Firstly, a DRL-based framework using onboard camera image is established, which simplifies the traditional modularization paradigm. Secondly, neural network architecture, reward functions, and soft actor-critic (SAC)-based speed command perception algorithm are designed to train the policy network. The output of the policy network is denormalized and directly used as speed control command, which realizes the UAV dynamic target tracking. Finally, the feasibility of the proposed end-to-end control method is demonstrated by numerical simulation. The results show that the proposed DRL-based framework is feasible to simplify the traditional modularization paradigm. The UAV can track the dynamic target with rapidly changing of speed and direction. MDPI 2022-11-11 /pmc/articles/PMC9680462/ /pubmed/36412725 http://dx.doi.org/10.3390/biomimetics7040197 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhao, Jiang Liu, Han Sun, Jiaming Wu, Kun Cai, Zhihao Ma, Yan Wang, Yingxun Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
title	Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
title_full	Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
title_fullStr	Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
title_full_unstemmed	Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
title_short	Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
title_sort	deep reinforcement learning-based end-to-end control for uav dynamic target tracking
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9680462/ https://www.ncbi.nlm.nih.gov/pubmed/36412725 http://dx.doi.org/10.3390/biomimetics7040197
work_keys_str_mv	AT zhaojiang deepreinforcementlearningbasedendtoendcontrolforuavdynamictargettracking AT liuhan deepreinforcementlearningbasedendtoendcontrolforuavdynamictargettracking AT sunjiaming deepreinforcementlearningbasedendtoendcontrolforuavdynamictargettracking AT wukun deepreinforcementlearningbasedendtoendcontrolforuavdynamictargettracking AT caizhihao deepreinforcementlearningbasedendtoendcontrolforuavdynamictargettracking AT mayan deepreinforcementlearningbasedendtoendcontrolforuavdynamictargettracking AT wangyingxun deepreinforcementlearningbasedendtoendcontrolforuavdynamictargettracking

Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking

Ejemplares similares