Cargando…
Spike-based Decision Learning of Nash Equilibria in Two-Player Games
Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknow...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459907/ https://www.ncbi.nlm.nih.gov/pubmed/23028289 http://dx.doi.org/10.1371/journal.pcbi.1002691 |
_version_ | 1782244880591880192 |
---|---|
author | Friedrich, Johannes Senn, Walter |
author_facet | Friedrich, Johannes Senn, Walter |
author_sort | Friedrich, Johannes |
collection | PubMed |
description | Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that successfully acquires optimal strategies for classical game-theoretical tasks. The suggested population reinforcement learning reproduces data from human behavioral experiments for the blackjack and the inspector game. It performs optimally according to a pure (deterministic) and mixed (stochastic) Nash equilibrium, respectively. In contrast, temporal-difference(TD)-learning, covariance-learning, and basic reinforcement learning fail to perform optimally for the stochastic strategy. Spike-based population reinforcement learning, shown to follow the stochastic reward gradient, is therefore a viable candidate to explain automated decision learning of a Nash equilibrium in two-player games. |
format | Online Article Text |
id | pubmed-3459907 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-34599072012-10-01 Spike-based Decision Learning of Nash Equilibria in Two-Player Games Friedrich, Johannes Senn, Walter PLoS Comput Biol Research Article Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that successfully acquires optimal strategies for classical game-theoretical tasks. The suggested population reinforcement learning reproduces data from human behavioral experiments for the blackjack and the inspector game. It performs optimally according to a pure (deterministic) and mixed (stochastic) Nash equilibrium, respectively. In contrast, temporal-difference(TD)-learning, covariance-learning, and basic reinforcement learning fail to perform optimally for the stochastic strategy. Spike-based population reinforcement learning, shown to follow the stochastic reward gradient, is therefore a viable candidate to explain automated decision learning of a Nash equilibrium in two-player games. Public Library of Science 2012-09-27 /pmc/articles/PMC3459907/ /pubmed/23028289 http://dx.doi.org/10.1371/journal.pcbi.1002691 Text en © 2012 Friedrich, Senn http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Friedrich, Johannes Senn, Walter Spike-based Decision Learning of Nash Equilibria in Two-Player Games |
title | Spike-based Decision Learning of Nash Equilibria in Two-Player Games |
title_full | Spike-based Decision Learning of Nash Equilibria in Two-Player Games |
title_fullStr | Spike-based Decision Learning of Nash Equilibria in Two-Player Games |
title_full_unstemmed | Spike-based Decision Learning of Nash Equilibria in Two-Player Games |
title_short | Spike-based Decision Learning of Nash Equilibria in Two-Player Games |
title_sort | spike-based decision learning of nash equilibria in two-player games |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459907/ https://www.ncbi.nlm.nih.gov/pubmed/23028289 http://dx.doi.org/10.1371/journal.pcbi.1002691 |
work_keys_str_mv | AT friedrichjohannes spikebaseddecisionlearningofnashequilibriaintwoplayergames AT sennwalter spikebaseddecisionlearningofnashequilibriaintwoplayergames |