Cargando…
Learning Attentional and Gated Communication via Curiosity
Due to the partial observability in decentralized multi-agent systems, communication is critical for cooperation. Furthermore, the ability to decide when and whom to communicate is important to achieve efficient communication. However, the existing methods are typically driven by extrinsic rewards....
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9064529/ https://www.ncbi.nlm.nih.gov/pubmed/35515506 http://dx.doi.org/10.1155/2022/2951193 |
_version_ | 1784699398549143552 |
---|---|
author | Sun, Chuxiong Zhou, Kaijie Cong, Cong Li, Kai Wang, Rui Hu, Xiaohui |
author_facet | Sun, Chuxiong Zhou, Kaijie Cong, Cong Li, Kai Wang, Rui Hu, Xiaohui |
author_sort | Sun, Chuxiong |
collection | PubMed |
description | Due to the partial observability in decentralized multi-agent systems, communication is critical for cooperation. Furthermore, the ability to decide when and whom to communicate is important to achieve efficient communication. However, the existing methods are typically driven by extrinsic rewards. Hence, when the reward from environment is sparse, delayed, or noisy, the communication performance of these methods would be restricted. Furthermore, it would introduce additional difficulty named credit assignment when using extrinsic reward to train communication and sample policies together. To tackle these difficulties, we introduce the mechanism of intrinsic motivation from psychology to multi-agent communication. We hold the view that the observations with more uncertainty and curiosity are more valuable for communication. It can help agent find useful information from observations. It is a good complement to existing extrinsic driven methods. Concretely, at sending end, we learn a curiosity from local observations to model the communication importance. Then, we design a heuristic mechanism to prune unnecessary messages. It can solve the problem of when to communicate. Then, the ability to gate unnecessary message can reduce the cost and improve the efficiency of communication, which is important to apply to real-world scenarios. Furthermore, at receiving end, we utilize the intrinsic importance to differentiate information, which can be helpful for local decisions. It could solve the problem of whom to communicate. The ability to pay attention to useful information can efficiently improve the performance of communication behaviors. At last, we evaluate our method on a variety of multi-agent scenarios. The experiments of full communication demonstrate that the curiosity is capable to model the communication importance, and the results of gated communication further prove the conclusion. |
format | Online Article Text |
id | pubmed-9064529 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-90645292022-05-04 Learning Attentional and Gated Communication via Curiosity Sun, Chuxiong Zhou, Kaijie Cong, Cong Li, Kai Wang, Rui Hu, Xiaohui Comput Intell Neurosci Research Article Due to the partial observability in decentralized multi-agent systems, communication is critical for cooperation. Furthermore, the ability to decide when and whom to communicate is important to achieve efficient communication. However, the existing methods are typically driven by extrinsic rewards. Hence, when the reward from environment is sparse, delayed, or noisy, the communication performance of these methods would be restricted. Furthermore, it would introduce additional difficulty named credit assignment when using extrinsic reward to train communication and sample policies together. To tackle these difficulties, we introduce the mechanism of intrinsic motivation from psychology to multi-agent communication. We hold the view that the observations with more uncertainty and curiosity are more valuable for communication. It can help agent find useful information from observations. It is a good complement to existing extrinsic driven methods. Concretely, at sending end, we learn a curiosity from local observations to model the communication importance. Then, we design a heuristic mechanism to prune unnecessary messages. It can solve the problem of when to communicate. Then, the ability to gate unnecessary message can reduce the cost and improve the efficiency of communication, which is important to apply to real-world scenarios. Furthermore, at receiving end, we utilize the intrinsic importance to differentiate information, which can be helpful for local decisions. It could solve the problem of whom to communicate. The ability to pay attention to useful information can efficiently improve the performance of communication behaviors. At last, we evaluate our method on a variety of multi-agent scenarios. The experiments of full communication demonstrate that the curiosity is capable to model the communication importance, and the results of gated communication further prove the conclusion. Hindawi 2022-04-26 /pmc/articles/PMC9064529/ /pubmed/35515506 http://dx.doi.org/10.1155/2022/2951193 Text en Copyright © 2022 Chuxiong Sun et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Sun, Chuxiong Zhou, Kaijie Cong, Cong Li, Kai Wang, Rui Hu, Xiaohui Learning Attentional and Gated Communication via Curiosity |
title | Learning Attentional and Gated Communication via Curiosity |
title_full | Learning Attentional and Gated Communication via Curiosity |
title_fullStr | Learning Attentional and Gated Communication via Curiosity |
title_full_unstemmed | Learning Attentional and Gated Communication via Curiosity |
title_short | Learning Attentional and Gated Communication via Curiosity |
title_sort | learning attentional and gated communication via curiosity |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9064529/ https://www.ncbi.nlm.nih.gov/pubmed/35515506 http://dx.doi.org/10.1155/2022/2951193 |
work_keys_str_mv | AT sunchuxiong learningattentionalandgatedcommunicationviacuriosity AT zhoukaijie learningattentionalandgatedcommunicationviacuriosity AT congcong learningattentionalandgatedcommunicationviacuriosity AT likai learningattentionalandgatedcommunicationviacuriosity AT wangrui learningattentionalandgatedcommunicationviacuriosity AT huxiaohui learningattentionalandgatedcommunicationviacuriosity |