Cargando…

An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning

A pursuit–evasion game is a classical maneuver confrontation problem in the multi-agent systems (MASs) domain. An online decision technique based on deep reinforcement learning (DRL) was developed in this paper to address the problem of environment sensing and decision-making in pursuit–evasion game...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wan, Kaifang, Wu, Dingwei, Zhai, Yiwei, Li, Bo, Gao, Xiaoguang, Hu, Zijian
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8625563/ https://www.ncbi.nlm.nih.gov/pubmed/34828131 http://dx.doi.org/10.3390/e23111433

_version_	1784606452107706368
author	Wan, Kaifang Wu, Dingwei Zhai, Yiwei Li, Bo Gao, Xiaoguang Hu, Zijian
author_facet	Wan, Kaifang Wu, Dingwei Zhai, Yiwei Li, Bo Gao, Xiaoguang Hu, Zijian
author_sort	Wan, Kaifang
collection	PubMed
description	A pursuit–evasion game is a classical maneuver confrontation problem in the multi-agent systems (MASs) domain. An online decision technique based on deep reinforcement learning (DRL) was developed in this paper to address the problem of environment sensing and decision-making in pursuit–evasion games. A control-oriented framework developed from the DRL-based multi-agent deep deterministic policy gradient (MADDPG) algorithm was built to implement multi-agent cooperative decision-making to overcome the limitation of the tedious state variables required for the traditionally complicated modeling process. To address the effects of errors between a model and a real scenario, this paper introduces adversarial disturbances. It also proposes a novel adversarial attack trick and adversarial learning MADDPG (A2-MADDPG) algorithm. By introducing an adversarial attack trick for the agents themselves, uncertainties of the real world are modeled, thereby optimizing robust training. During the training process, adversarial learning was incorporated into our algorithm to preprocess the actions of multiple agents, which enabled them to properly respond to uncertain dynamic changes in MASs. Experimental results verified that the proposed approach provides superior performance and effectiveness for pursuers and evaders, and both can learn the corresponding confrontational strategy during training.
format	Online Article Text
id	pubmed-8625563
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-86255632021-11-27 An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning Wan, Kaifang Wu, Dingwei Zhai, Yiwei Li, Bo Gao, Xiaoguang Hu, Zijian Entropy (Basel) Article A pursuit–evasion game is a classical maneuver confrontation problem in the multi-agent systems (MASs) domain. An online decision technique based on deep reinforcement learning (DRL) was developed in this paper to address the problem of environment sensing and decision-making in pursuit–evasion games. A control-oriented framework developed from the DRL-based multi-agent deep deterministic policy gradient (MADDPG) algorithm was built to implement multi-agent cooperative decision-making to overcome the limitation of the tedious state variables required for the traditionally complicated modeling process. To address the effects of errors between a model and a real scenario, this paper introduces adversarial disturbances. It also proposes a novel adversarial attack trick and adversarial learning MADDPG (A2-MADDPG) algorithm. By introducing an adversarial attack trick for the agents themselves, uncertainties of the real world are modeled, thereby optimizing robust training. During the training process, adversarial learning was incorporated into our algorithm to preprocess the actions of multiple agents, which enabled them to properly respond to uncertain dynamic changes in MASs. Experimental results verified that the proposed approach provides superior performance and effectiveness for pursuers and evaders, and both can learn the corresponding confrontational strategy during training. MDPI 2021-10-29 /pmc/articles/PMC8625563/ /pubmed/34828131 http://dx.doi.org/10.3390/e23111433 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wan, Kaifang Wu, Dingwei Zhai, Yiwei Li, Bo Gao, Xiaoguang Hu, Zijian An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning
title	An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning
title_full	An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning
title_fullStr	An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning
title_full_unstemmed	An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning
title_short	An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning
title_sort	improved approach towards multi-agent pursuit–evasion game decision-making using deep reinforcement learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8625563/ https://www.ncbi.nlm.nih.gov/pubmed/34828131 http://dx.doi.org/10.3390/e23111433
work_keys_str_mv	AT wankaifang animprovedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT wudingwei animprovedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT zhaiyiwei animprovedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT libo animprovedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT gaoxiaoguang animprovedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT huzijian animprovedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT wankaifang improvedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT wudingwei improvedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT zhaiyiwei improvedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT libo improvedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT gaoxiaoguang improvedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning AT huzijian improvedapproachtowardsmultiagentpursuitevasiongamedecisionmakingusingdeepreinforcementlearning

An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning

Ejemplares similares