Cargando…
A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning
In this paper, we propose a multiphase semistatic training method for swarm confrontation using multi-agent deep reinforcement learning. In particular, we build a swarm confrontation game, the 3V3 tank fight, based on the Unity platform and train the agents by a MDRL algorithm called MA-POCA, coming...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10348848/ https://www.ncbi.nlm.nih.gov/pubmed/37455769 http://dx.doi.org/10.1155/2023/2955442 |
_version_ | 1785073750037757952 |
---|---|
author | Cai, He Luo, Yaoguo Gao, Huanli Chi, Jiale Wang, Shuozhe |
author_facet | Cai, He Luo, Yaoguo Gao, Huanli Chi, Jiale Wang, Shuozhe |
author_sort | Cai, He |
collection | PubMed |
description | In this paper, we propose a multiphase semistatic training method for swarm confrontation using multi-agent deep reinforcement learning. In particular, we build a swarm confrontation game, the 3V3 tank fight, based on the Unity platform and train the agents by a MDRL algorithm called MA-POCA, coming with the ML-Agent toolkit. By multiphase learning, we split the traditional single training phase into multiple consecutive training phases, where the performance level of the strong team for each phase increases in an incremental way. On the other hand, by semistatic learning, the strong team in all phases will stop learning when fighting against the weak team, which reduces the possibility that the weak team keeps being defeated and learns nothing at all. Comprehensive experiments prove that, in contrast to the traditional single-phase training method, the multiphase semistatic training method proposed in this paper can significantly increase the training efficiency, shedding lights on how the weak could learn from the strong with less time and computational cost. |
format | Online Article Text |
id | pubmed-10348848 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-103488482023-07-15 A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning Cai, He Luo, Yaoguo Gao, Huanli Chi, Jiale Wang, Shuozhe Comput Intell Neurosci Research Article In this paper, we propose a multiphase semistatic training method for swarm confrontation using multi-agent deep reinforcement learning. In particular, we build a swarm confrontation game, the 3V3 tank fight, based on the Unity platform and train the agents by a MDRL algorithm called MA-POCA, coming with the ML-Agent toolkit. By multiphase learning, we split the traditional single training phase into multiple consecutive training phases, where the performance level of the strong team for each phase increases in an incremental way. On the other hand, by semistatic learning, the strong team in all phases will stop learning when fighting against the weak team, which reduces the possibility that the weak team keeps being defeated and learns nothing at all. Comprehensive experiments prove that, in contrast to the traditional single-phase training method, the multiphase semistatic training method proposed in this paper can significantly increase the training efficiency, shedding lights on how the weak could learn from the strong with less time and computational cost. Hindawi 2023-07-07 /pmc/articles/PMC10348848/ /pubmed/37455769 http://dx.doi.org/10.1155/2023/2955442 Text en Copyright © 2023 He Cai et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Cai, He Luo, Yaoguo Gao, Huanli Chi, Jiale Wang, Shuozhe A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning |
title | A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning |
title_full | A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning |
title_fullStr | A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning |
title_full_unstemmed | A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning |
title_short | A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning |
title_sort | multiphase semistatic training method for swarm confrontation using multiagent deep reinforcement learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10348848/ https://www.ncbi.nlm.nih.gov/pubmed/37455769 http://dx.doi.org/10.1155/2023/2955442 |
work_keys_str_mv | AT caihe amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT luoyaoguo amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT gaohuanli amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT chijiale amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT wangshuozhe amultiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT caihe multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT luoyaoguo multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT gaohuanli multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT chijiale multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning AT wangshuozhe multiphasesemistatictrainingmethodforswarmconfrontationusingmultiagentdeepreinforcementlearning |