Cargando…

A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning

In this paper, we propose a multiphase semistatic training method for swarm confrontation using multi-agent deep reinforcement learning. In particular, we build a swarm confrontation game, the 3V3 tank fight, based on the Unity platform and train the agents by a MDRL algorithm called MA-POCA, coming...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, He, Luo, Yaoguo, Gao, Huanli, Chi, Jiale, Wang, Shuozhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10348848/
https://www.ncbi.nlm.nih.gov/pubmed/37455769
http://dx.doi.org/10.1155/2023/2955442
_version_ 1785073750037757952
author Cai, He
Luo, Yaoguo
Gao, Huanli
Chi, Jiale
Wang, Shuozhe
author_facet Cai, He
Luo, Yaoguo
Gao, Huanli
Chi, Jiale
Wang, Shuozhe
author_sort Cai, He
collection PubMed
description In this paper, we propose a multiphase semistatic training method for swarm confrontation using multi-agent deep reinforcement learning. In particular, we build a swarm confrontation game, the 3V3 tank fight, based on the Unity platform and train the agents by a MDRL algorithm called MA-POCA, coming with the ML-Agent toolkit. By multiphase learning, we split the traditional single training phase into multiple consecutive training phases, where the performance level of the strong team for each phase increases in an incremental way. On the other hand, by semistatic learning, the strong team in all phases will stop learning when fighting against the weak team, which reduces the possibility that the weak team keeps being defeated and learns nothing at all. Comprehensive experiments prove that, in contrast to the traditional single-phase training method, the multiphase semistatic training method proposed in this paper can significantly increase the training efficiency, shedding lights on how the weak could learn from the strong with less time and computational cost.
format Online
Article
Text
id pubmed-10348848
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-103488482023-07-15 A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning Cai, He Luo, Yaoguo Gao, Huanli Chi, Jiale Wang, Shuozhe Comput Intell Neurosci Research Article In this paper, we propose a multiphase semistatic training method for swarm confrontation using multi-agent deep reinforcement learning. In particular, we build a swarm confrontation game, the 3V3 tank fight, based on the Unity platform and train the agents by a MDRL algorithm called MA-POCA, coming with the ML-Agent toolkit. By multiphase learning, we split the traditional single training phase into multiple consecutive training phases, where the performance level of the strong team for each phase increases in an incremental way. On the other hand, by semistatic learning, the strong team in all phases will stop learning when fighting against the weak team, which reduces the possibility that the weak team keeps being defeated and learns nothing at all. Comprehensive experiments prove that, in contrast to the traditional single-phase training method, the multiphase semistatic training method proposed in this paper can significantly increase the training efficiency, shedding lights on how the weak could learn from the strong with less time and computational cost. Hindawi 2023-07-07 /pmc/articles/PMC10348848/ /pubmed/37455769 http://dx.doi.org/10.1155/2023/2955442 Text en Copyright © 2023 He Cai et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Cai, He
Luo, Yaoguo
Gao, Huanli
Chi, Jiale
Wang, Shuozhe
A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning
title A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning
title_full A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning
title_fullStr A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning
title_full_unstemmed A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning
title_short A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning
title_sort multiphase semistatic training method for swarm confrontation using multiagent deep reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10348848/
https://www.ncbi.nlm.nih.gov/pubmed/37455769
http://dx.doi.org/10.1155/2023/2955442
work_keys_str_mv AT caihe amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT luoyaoguo amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT gaohuanli amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT chijiale amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT wangshuozhe amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT caihe multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT luoyaoguo multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT gaohuanli multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT chijiale multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning
AT wangshuozhe multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning