Cargando…

Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning

In this paper, a cooperative search method for multiple UAVs is proposed to solve the problem of low efficiency of multi-UAV task execution by using a cooperative game with incomplete information. To improve search efficiency, CBBA (Consensus-Based Bundle Algorithm) is applied to designate the tasks...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Mingsheng, Zhang, Xiaoxuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9501099/
https://www.ncbi.nlm.nih.gov/pubmed/36146083
http://dx.doi.org/10.3390/s22186737
_version_ 1784795390404460544
author Gao, Mingsheng
Zhang, Xiaoxuan
author_facet Gao, Mingsheng
Zhang, Xiaoxuan
author_sort Gao, Mingsheng
collection PubMed
description In this paper, a cooperative search method for multiple UAVs is proposed to solve the problem of low efficiency of multi-UAV task execution by using a cooperative game with incomplete information. To improve search efficiency, CBBA (Consensus-Based Bundle Algorithm) is applied to designate the tasks area for each UAV. Then, Independent Deep Reinforcement Learning (IDRL) is used to solve Nash equilibrium to improve UAVs’ collaborations. The proposed reward function is smartly developed to guide UAVs to fly along the path with higher reward value while avoiding the collisions between UAVs during flights. Finally, extensive experiments are carried out to compare our proposed method with other algorithms. Simulation results show that the proposed method can obtain more rewards in the same period of time as other algorithms.
format Online
Article
Text
id pubmed-9501099
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95010992022-09-24 Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning Gao, Mingsheng Zhang, Xiaoxuan Sensors (Basel) Article In this paper, a cooperative search method for multiple UAVs is proposed to solve the problem of low efficiency of multi-UAV task execution by using a cooperative game with incomplete information. To improve search efficiency, CBBA (Consensus-Based Bundle Algorithm) is applied to designate the tasks area for each UAV. Then, Independent Deep Reinforcement Learning (IDRL) is used to solve Nash equilibrium to improve UAVs’ collaborations. The proposed reward function is smartly developed to guide UAVs to fly along the path with higher reward value while avoiding the collisions between UAVs during flights. Finally, extensive experiments are carried out to compare our proposed method with other algorithms. Simulation results show that the proposed method can obtain more rewards in the same period of time as other algorithms. MDPI 2022-09-06 /pmc/articles/PMC9501099/ /pubmed/36146083 http://dx.doi.org/10.3390/s22186737 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gao, Mingsheng
Zhang, Xiaoxuan
Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
title Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
title_full Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
title_fullStr Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
title_full_unstemmed Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
title_short Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
title_sort cooperative search method for multiple uavs based on deep reinforcement learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9501099/
https://www.ncbi.nlm.nih.gov/pubmed/36146083
http://dx.doi.org/10.3390/s22186737
work_keys_str_mv AT gaomingsheng cooperativesearchmethodformultipleuavsbasedondeepreinforcementlearning
AT zhangxiaoxuan cooperativesearchmethodformultipleuavsbasedondeepreinforcementlearning