Cargando…
A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning
The unmanned aerial vehicle (UAV) swarm is regarded as having a significant role in modern warfare. The demand for UAV swarms with the capability of attack-defense confrontation is urgent. The existing decision-making methods of UAV swarm confrontation, such as multi-agent reinforcement learning (MA...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296010/ https://www.ncbi.nlm.nih.gov/pubmed/37366817 http://dx.doi.org/10.3390/biomimetics8020222 |
_version_ | 1785063556807393280 |
---|---|
author | Chi, Pei Wei, Jiahong Wu, Kun Di, Bin Wang, Yingxun |
author_facet | Chi, Pei Wei, Jiahong Wu, Kun Di, Bin Wang, Yingxun |
author_sort | Chi, Pei |
collection | PubMed |
description | The unmanned aerial vehicle (UAV) swarm is regarded as having a significant role in modern warfare. The demand for UAV swarms with the capability of attack-defense confrontation is urgent. The existing decision-making methods of UAV swarm confrontation, such as multi-agent reinforcement learning (MARL), suffer from an exponential increase in training time as the size of the swarm increases. Inspired by group hunting behavior in nature, this paper presents a new bio-inspired decision-making method for UAV swarms for attack-defense confrontation via MARL. Firstly, a UAV swarm decision-making framework for confrontation based on grouping mechanisms is established. Secondly, a bio-inspired action space is designed, and a dense reward is added to the reward function to accelerate the convergence speed of training. Finally, numerical experiments are conducted to evaluate the performance of our method. The experiment results show that the proposed method can be applied to a swarm of 12 UAVs, and when the maximum acceleration of the enemy UAV is within 2.5 times ours, the swarm can well intercept the enemy, and the success rate is above 91%. |
format | Online Article Text |
id | pubmed-10296010 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102960102023-06-28 A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning Chi, Pei Wei, Jiahong Wu, Kun Di, Bin Wang, Yingxun Biomimetics (Basel) Article The unmanned aerial vehicle (UAV) swarm is regarded as having a significant role in modern warfare. The demand for UAV swarms with the capability of attack-defense confrontation is urgent. The existing decision-making methods of UAV swarm confrontation, such as multi-agent reinforcement learning (MARL), suffer from an exponential increase in training time as the size of the swarm increases. Inspired by group hunting behavior in nature, this paper presents a new bio-inspired decision-making method for UAV swarms for attack-defense confrontation via MARL. Firstly, a UAV swarm decision-making framework for confrontation based on grouping mechanisms is established. Secondly, a bio-inspired action space is designed, and a dense reward is added to the reward function to accelerate the convergence speed of training. Finally, numerical experiments are conducted to evaluate the performance of our method. The experiment results show that the proposed method can be applied to a swarm of 12 UAVs, and when the maximum acceleration of the enemy UAV is within 2.5 times ours, the swarm can well intercept the enemy, and the success rate is above 91%. MDPI 2023-05-25 /pmc/articles/PMC10296010/ /pubmed/37366817 http://dx.doi.org/10.3390/biomimetics8020222 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chi, Pei Wei, Jiahong Wu, Kun Di, Bin Wang, Yingxun A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning |
title | A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning |
title_full | A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning |
title_fullStr | A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning |
title_full_unstemmed | A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning |
title_short | A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning |
title_sort | bio-inspired decision-making method of uav swarm for attack-defense confrontation via multi-agent reinforcement learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296010/ https://www.ncbi.nlm.nih.gov/pubmed/37366817 http://dx.doi.org/10.3390/biomimetics8020222 |
work_keys_str_mv | AT chipei abioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT weijiahong abioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT wukun abioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT dibin abioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT wangyingxun abioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT chipei bioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT weijiahong bioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT wukun bioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT dibin bioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning AT wangyingxun bioinspireddecisionmakingmethodofuavswarmforattackdefenseconfrontationviamultiagentreinforcementlearning |