Cargando…

Multiagent cooperation and competition with deep reinforcement learning

Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as t...

Descripción completa

Detalles Bibliográficos
Autores principales: Tampuu, Ardi, Matiisen, Tambet, Kodelja, Dorian, Kuzovkin, Ilya, Korjus, Kristjan, Aru, Juhan, Aru, Jaan, Vicente, Raul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381785/
https://www.ncbi.nlm.nih.gov/pubmed/28380078
http://dx.doi.org/10.1371/journal.pone.0172395
_version_ 1782519991842635776
author Tampuu, Ardi
Matiisen, Tambet
Kodelja, Dorian
Kuzovkin, Ilya
Korjus, Kristjan
Aru, Juhan
Aru, Jaan
Vicente, Raul
author_facet Tampuu, Ardi
Matiisen, Tambet
Kodelja, Dorian
Kuzovkin, Ilya
Korjus, Kristjan
Aru, Juhan
Aru, Jaan
Vicente, Raul
author_sort Tampuu, Ardi
collection PubMed
description Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as the state representation. In particular, we extend the Deep Q-Learning framework to multiagent environments to investigate the interaction between two learning agents in the well-known video game Pong. By manipulating the classical rewarding scheme of Pong we show how competitive and collaborative behaviors emerge. We also describe the progression from competitive to collaborative behavior when the incentive to cooperate is increased. Finally we show how learning by playing against another adaptive agent, instead of against a hard-wired algorithm, results in more robust strategies. The present work shows that Deep Q-Networks can become a useful tool for studying decentralized learning of multiagent systems coping with high-dimensional environments.
format Online
Article
Text
id pubmed-5381785
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53817852017-04-19 Multiagent cooperation and competition with deep reinforcement learning Tampuu, Ardi Matiisen, Tambet Kodelja, Dorian Kuzovkin, Ilya Korjus, Kristjan Aru, Juhan Aru, Jaan Vicente, Raul PLoS One Research Article Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as the state representation. In particular, we extend the Deep Q-Learning framework to multiagent environments to investigate the interaction between two learning agents in the well-known video game Pong. By manipulating the classical rewarding scheme of Pong we show how competitive and collaborative behaviors emerge. We also describe the progression from competitive to collaborative behavior when the incentive to cooperate is increased. Finally we show how learning by playing against another adaptive agent, instead of against a hard-wired algorithm, results in more robust strategies. The present work shows that Deep Q-Networks can become a useful tool for studying decentralized learning of multiagent systems coping with high-dimensional environments. Public Library of Science 2017-04-05 /pmc/articles/PMC5381785/ /pubmed/28380078 http://dx.doi.org/10.1371/journal.pone.0172395 Text en © 2017 Tampuu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tampuu, Ardi
Matiisen, Tambet
Kodelja, Dorian
Kuzovkin, Ilya
Korjus, Kristjan
Aru, Juhan
Aru, Jaan
Vicente, Raul
Multiagent cooperation and competition with deep reinforcement learning
title Multiagent cooperation and competition with deep reinforcement learning
title_full Multiagent cooperation and competition with deep reinforcement learning
title_fullStr Multiagent cooperation and competition with deep reinforcement learning
title_full_unstemmed Multiagent cooperation and competition with deep reinforcement learning
title_short Multiagent cooperation and competition with deep reinforcement learning
title_sort multiagent cooperation and competition with deep reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381785/
https://www.ncbi.nlm.nih.gov/pubmed/28380078
http://dx.doi.org/10.1371/journal.pone.0172395
work_keys_str_mv AT tampuuardi multiagentcooperationandcompetitionwithdeepreinforcementlearning
AT matiisentambet multiagentcooperationandcompetitionwithdeepreinforcementlearning
AT kodeljadorian multiagentcooperationandcompetitionwithdeepreinforcementlearning
AT kuzovkinilya multiagentcooperationandcompetitionwithdeepreinforcementlearning
AT korjuskristjan multiagentcooperationandcompetitionwithdeepreinforcementlearning
AT arujuhan multiagentcooperationandcompetitionwithdeepreinforcementlearning
AT arujaan multiagentcooperationandcompetitionwithdeepreinforcementlearning
AT vicenteraul multiagentcooperationandcompetitionwithdeepreinforcementlearning