Cargando…
Multiagent cooperation and competition with deep reinforcement learning
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as t...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381785/ https://www.ncbi.nlm.nih.gov/pubmed/28380078 http://dx.doi.org/10.1371/journal.pone.0172395 |
_version_ | 1782519991842635776 |
---|---|
author | Tampuu, Ardi Matiisen, Tambet Kodelja, Dorian Kuzovkin, Ilya Korjus, Kristjan Aru, Juhan Aru, Jaan Vicente, Raul |
author_facet | Tampuu, Ardi Matiisen, Tambet Kodelja, Dorian Kuzovkin, Ilya Korjus, Kristjan Aru, Juhan Aru, Jaan Vicente, Raul |
author_sort | Tampuu, Ardi |
collection | PubMed |
description | Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as the state representation. In particular, we extend the Deep Q-Learning framework to multiagent environments to investigate the interaction between two learning agents in the well-known video game Pong. By manipulating the classical rewarding scheme of Pong we show how competitive and collaborative behaviors emerge. We also describe the progression from competitive to collaborative behavior when the incentive to cooperate is increased. Finally we show how learning by playing against another adaptive agent, instead of against a hard-wired algorithm, results in more robust strategies. The present work shows that Deep Q-Networks can become a useful tool for studying decentralized learning of multiagent systems coping with high-dimensional environments. |
format | Online Article Text |
id | pubmed-5381785 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53817852017-04-19 Multiagent cooperation and competition with deep reinforcement learning Tampuu, Ardi Matiisen, Tambet Kodelja, Dorian Kuzovkin, Ilya Korjus, Kristjan Aru, Juhan Aru, Jaan Vicente, Raul PLoS One Research Article Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as the state representation. In particular, we extend the Deep Q-Learning framework to multiagent environments to investigate the interaction between two learning agents in the well-known video game Pong. By manipulating the classical rewarding scheme of Pong we show how competitive and collaborative behaviors emerge. We also describe the progression from competitive to collaborative behavior when the incentive to cooperate is increased. Finally we show how learning by playing against another adaptive agent, instead of against a hard-wired algorithm, results in more robust strategies. The present work shows that Deep Q-Networks can become a useful tool for studying decentralized learning of multiagent systems coping with high-dimensional environments. Public Library of Science 2017-04-05 /pmc/articles/PMC5381785/ /pubmed/28380078 http://dx.doi.org/10.1371/journal.pone.0172395 Text en © 2017 Tampuu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Tampuu, Ardi Matiisen, Tambet Kodelja, Dorian Kuzovkin, Ilya Korjus, Kristjan Aru, Juhan Aru, Jaan Vicente, Raul Multiagent cooperation and competition with deep reinforcement learning |
title | Multiagent cooperation and competition with deep reinforcement learning |
title_full | Multiagent cooperation and competition with deep reinforcement learning |
title_fullStr | Multiagent cooperation and competition with deep reinforcement learning |
title_full_unstemmed | Multiagent cooperation and competition with deep reinforcement learning |
title_short | Multiagent cooperation and competition with deep reinforcement learning |
title_sort | multiagent cooperation and competition with deep reinforcement learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381785/ https://www.ncbi.nlm.nih.gov/pubmed/28380078 http://dx.doi.org/10.1371/journal.pone.0172395 |
work_keys_str_mv | AT tampuuardi multiagentcooperationandcompetitionwithdeepreinforcementlearning AT matiisentambet multiagentcooperationandcompetitionwithdeepreinforcementlearning AT kodeljadorian multiagentcooperationandcompetitionwithdeepreinforcementlearning AT kuzovkinilya multiagentcooperationandcompetitionwithdeepreinforcementlearning AT korjuskristjan multiagentcooperationandcompetitionwithdeepreinforcementlearning AT arujuhan multiagentcooperationandcompetitionwithdeepreinforcementlearning AT arujaan multiagentcooperationandcompetitionwithdeepreinforcementlearning AT vicenteraul multiagentcooperationandcompetitionwithdeepreinforcementlearning |