Cargando…

Spike-based Decision Learning of Nash Equilibria in Two-Player Games

Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknow...

Descripción completa

Detalles Bibliográficos
Autores principales: Friedrich, Johannes, Senn, Walter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459907/
https://www.ncbi.nlm.nih.gov/pubmed/23028289
http://dx.doi.org/10.1371/journal.pcbi.1002691
_version_ 1782244880591880192
author Friedrich, Johannes
Senn, Walter
author_facet Friedrich, Johannes
Senn, Walter
author_sort Friedrich, Johannes
collection PubMed
description Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that successfully acquires optimal strategies for classical game-theoretical tasks. The suggested population reinforcement learning reproduces data from human behavioral experiments for the blackjack and the inspector game. It performs optimally according to a pure (deterministic) and mixed (stochastic) Nash equilibrium, respectively. In contrast, temporal-difference(TD)-learning, covariance-learning, and basic reinforcement learning fail to perform optimally for the stochastic strategy. Spike-based population reinforcement learning, shown to follow the stochastic reward gradient, is therefore a viable candidate to explain automated decision learning of a Nash equilibrium in two-player games.
format Online
Article
Text
id pubmed-3459907
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34599072012-10-01 Spike-based Decision Learning of Nash Equilibria in Two-Player Games Friedrich, Johannes Senn, Walter PLoS Comput Biol Research Article Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that successfully acquires optimal strategies for classical game-theoretical tasks. The suggested population reinforcement learning reproduces data from human behavioral experiments for the blackjack and the inspector game. It performs optimally according to a pure (deterministic) and mixed (stochastic) Nash equilibrium, respectively. In contrast, temporal-difference(TD)-learning, covariance-learning, and basic reinforcement learning fail to perform optimally for the stochastic strategy. Spike-based population reinforcement learning, shown to follow the stochastic reward gradient, is therefore a viable candidate to explain automated decision learning of a Nash equilibrium in two-player games. Public Library of Science 2012-09-27 /pmc/articles/PMC3459907/ /pubmed/23028289 http://dx.doi.org/10.1371/journal.pcbi.1002691 Text en © 2012 Friedrich, Senn http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Friedrich, Johannes
Senn, Walter
Spike-based Decision Learning of Nash Equilibria in Two-Player Games
title Spike-based Decision Learning of Nash Equilibria in Two-Player Games
title_full Spike-based Decision Learning of Nash Equilibria in Two-Player Games
title_fullStr Spike-based Decision Learning of Nash Equilibria in Two-Player Games
title_full_unstemmed Spike-based Decision Learning of Nash Equilibria in Two-Player Games
title_short Spike-based Decision Learning of Nash Equilibria in Two-Player Games
title_sort spike-based decision learning of nash equilibria in two-player games
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459907/
https://www.ncbi.nlm.nih.gov/pubmed/23028289
http://dx.doi.org/10.1371/journal.pcbi.1002691
work_keys_str_mv AT friedrichjohannes spikebaseddecisionlearningofnashequilibriaintwoplayergames
AT sennwalter spikebaseddecisionlearningofnashequilibriaintwoplayergames