Cargando…

Multi-agent reinforcement learning with approximate model learning for competitive games

We propose a method for learning multi-agent policies to compete against multiple opponents. The method consists of recurrent neural network-based actor-critic networks and deterministic policy gradients that promote cooperation between agents by communication. The learning process does not require...

Descripción completa

Detalles Bibliográficos
Autores principales:	Park, Young Joon, Cho, Yoon Sang, Kim, Seoung Bum
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6739057/ https://www.ncbi.nlm.nih.gov/pubmed/31509568 http://dx.doi.org/10.1371/journal.pone.0222215

_version_	1783450892405571584
author	Park, Young Joon Cho, Yoon Sang Kim, Seoung Bum
author_facet	Park, Young Joon Cho, Yoon Sang Kim, Seoung Bum
author_sort	Park, Young Joon
collection	PubMed
description	We propose a method for learning multi-agent policies to compete against multiple opponents. The method consists of recurrent neural network-based actor-critic networks and deterministic policy gradients that promote cooperation between agents by communication. The learning process does not require access to opponents’ parameters or observations because the agents are trained separately from the opponents. The actor networks enable the agents to communicate using forward and backward paths while the critic network helps to train the actors by delivering them gradient signals based on their contribution to the global reward. Moreover, to address nonstationarity due to the evolving of other agents, we propose approximate model learning using auxiliary prediction networks for modeling the state transitions, reward function, and opponent behavior. In the test phase, we use competitive multi-agent environments to demonstrate by comparison the usefulness and superiority of the proposed method in terms of learning efficiency and goal achievements. The comparison results show that the proposed method outperforms the alternatives.
format	Online Article Text
id	pubmed-6739057
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-67390572019-09-20 Multi-agent reinforcement learning with approximate model learning for competitive games Park, Young Joon Cho, Yoon Sang Kim, Seoung Bum PLoS One Research Article We propose a method for learning multi-agent policies to compete against multiple opponents. The method consists of recurrent neural network-based actor-critic networks and deterministic policy gradients that promote cooperation between agents by communication. The learning process does not require access to opponents’ parameters or observations because the agents are trained separately from the opponents. The actor networks enable the agents to communicate using forward and backward paths while the critic network helps to train the actors by delivering them gradient signals based on their contribution to the global reward. Moreover, to address nonstationarity due to the evolving of other agents, we propose approximate model learning using auxiliary prediction networks for modeling the state transitions, reward function, and opponent behavior. In the test phase, we use competitive multi-agent environments to demonstrate by comparison the usefulness and superiority of the proposed method in terms of learning efficiency and goal achievements. The comparison results show that the proposed method outperforms the alternatives. Public Library of Science 2019-09-11 /pmc/articles/PMC6739057/ /pubmed/31509568 http://dx.doi.org/10.1371/journal.pone.0222215 Text en © 2019 Park et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Park, Young Joon Cho, Yoon Sang Kim, Seoung Bum Multi-agent reinforcement learning with approximate model learning for competitive games
title	Multi-agent reinforcement learning with approximate model learning for competitive games
title_full	Multi-agent reinforcement learning with approximate model learning for competitive games
title_fullStr	Multi-agent reinforcement learning with approximate model learning for competitive games
title_full_unstemmed	Multi-agent reinforcement learning with approximate model learning for competitive games
title_short	Multi-agent reinforcement learning with approximate model learning for competitive games
title_sort	multi-agent reinforcement learning with approximate model learning for competitive games
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6739057/ https://www.ncbi.nlm.nih.gov/pubmed/31509568 http://dx.doi.org/10.1371/journal.pone.0222215
work_keys_str_mv	AT parkyoungjoon multiagentreinforcementlearningwithapproximatemodellearningforcompetitivegames AT choyoonsang multiagentreinforcementlearningwithapproximatemodellearningforcompetitivegames AT kimseoungbum multiagentreinforcementlearningwithapproximatemodellearningforcompetitivegames

Multi-agent reinforcement learning with approximate model learning for competitive games

Ejemplares similares