Cargando…

Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems

The aim of multi-agent reinforcement learning systems is to provide interacting agents with the ability to collaboratively learn and adapt to the behavior of other agents. Typically, an agent receives its private observations providing a partial view of the true state of the environment. However, in...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gu, Shanzhi, Geng, Mingyang, Lan, Long
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8469175/ https://www.ncbi.nlm.nih.gov/pubmed/34573757 http://dx.doi.org/10.3390/e23091133

_version_	1784573863248527360
author	Gu, Shanzhi Geng, Mingyang Lan, Long
author_facet	Gu, Shanzhi Geng, Mingyang Lan, Long
author_sort	Gu, Shanzhi
collection	PubMed
description	The aim of multi-agent reinforcement learning systems is to provide interacting agents with the ability to collaboratively learn and adapt to the behavior of other agents. Typically, an agent receives its private observations providing a partial view of the true state of the environment. However, in realistic settings, the harsh environment might cause one or more agents to show arbitrarily faulty or malicious behavior, which may suffice to allow the current coordination mechanisms fail. In this paper, we study a practical scenario of multi-agent reinforcement learning systems considering the security issues in the presence of agents with arbitrarily faulty or malicious behavior. The previous state-of-the-art work that coped with extremely noisy environments was designed on the basis that the noise intensity in the environment was known in advance. However, when the noise intensity changes, the existing method has to adjust the configuration of the model to learn in new environments, which limits the practical applications. To overcome these difficulties, we present an Attention-based Fault-Tolerant (FT-Attn) model, which can select not only correct, but also relevant information for each agent at every time step in noisy environments. The multihead attention mechanism enables the agents to learn effective communication policies through experience concurrent with the action policies. Empirical results showed that FT-Attn beats previous state-of-the-art methods in some extremely noisy environments in both cooperative and competitive scenarios, much closer to the upper-bound performance. Furthermore, FT-Attn maintains a more general fault tolerance ability and does not rely on the prior knowledge about the noise intensity of the environment.
format	Online Article Text
id	pubmed-8469175
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-84691752021-09-27 Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems Gu, Shanzhi Geng, Mingyang Lan, Long Entropy (Basel) Article The aim of multi-agent reinforcement learning systems is to provide interacting agents with the ability to collaboratively learn and adapt to the behavior of other agents. Typically, an agent receives its private observations providing a partial view of the true state of the environment. However, in realistic settings, the harsh environment might cause one or more agents to show arbitrarily faulty or malicious behavior, which may suffice to allow the current coordination mechanisms fail. In this paper, we study a practical scenario of multi-agent reinforcement learning systems considering the security issues in the presence of agents with arbitrarily faulty or malicious behavior. The previous state-of-the-art work that coped with extremely noisy environments was designed on the basis that the noise intensity in the environment was known in advance. However, when the noise intensity changes, the existing method has to adjust the configuration of the model to learn in new environments, which limits the practical applications. To overcome these difficulties, we present an Attention-based Fault-Tolerant (FT-Attn) model, which can select not only correct, but also relevant information for each agent at every time step in noisy environments. The multihead attention mechanism enables the agents to learn effective communication policies through experience concurrent with the action policies. Empirical results showed that FT-Attn beats previous state-of-the-art methods in some extremely noisy environments in both cooperative and competitive scenarios, much closer to the upper-bound performance. Furthermore, FT-Attn maintains a more general fault tolerance ability and does not rely on the prior knowledge about the noise intensity of the environment. MDPI 2021-08-31 /pmc/articles/PMC8469175/ /pubmed/34573757 http://dx.doi.org/10.3390/e23091133 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Gu, Shanzhi Geng, Mingyang Lan, Long Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
title	Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
title_full	Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
title_fullStr	Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
title_full_unstemmed	Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
title_short	Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
title_sort	attention-based fault-tolerant approach for multi-agent reinforcement learning systems
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8469175/ https://www.ncbi.nlm.nih.gov/pubmed/34573757 http://dx.doi.org/10.3390/e23091133
work_keys_str_mv	AT gushanzhi attentionbasedfaulttolerantapproachformultiagentreinforcementlearningsystems AT gengmingyang attentionbasedfaulttolerantapproachformultiagentreinforcementlearningsystems AT lanlong attentionbasedfaulttolerantapproachformultiagentreinforcementlearningsystems

Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems

Ejemplares similares