Cargando…

Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations

High performance computing on the Graphics Processing Unit (GPU) is an emerging field driven by the promise of high computational power at a low cost. However, GPU programming is a non-trivial task and moreover architectural limitations raise the question of whether investing effort in this directio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Richmond, Paul, Buesing, Lars, Giugliano, Michele, Vasilaki, Eleni
Formato:	Texto
Lenguaje:	English
Publicado:	Public Library of Science 2011
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3087717/ https://www.ncbi.nlm.nih.gov/pubmed/21572529 http://dx.doi.org/10.1371/journal.pone.0018539

_version_	1782202819708715008
author	Richmond, Paul Buesing, Lars Giugliano, Michele Vasilaki, Eleni
author_facet	Richmond, Paul Buesing, Lars Giugliano, Michele Vasilaki, Eleni
author_sort	Richmond, Paul
collection	PubMed
description	High performance computing on the Graphics Processing Unit (GPU) is an emerging field driven by the promise of high computational power at a low cost. However, GPU programming is a non-trivial task and moreover architectural limitations raise the question of whether investing effort in this direction may be worthwhile. In this work, we use GPU programming to simulate a two-layer network of Integrate-and-Fire neurons with varying degrees of recurrent connectivity and investigate its ability to learn a simplified navigation task using a policy-gradient learning rule stemming from Reinforcement Learning. The purpose of this paper is twofold. First, we want to support the use of GPUs in the field of Computational Neuroscience. Second, using GPU computing power, we investigate the conditions under which the said architecture and learning rule demonstrate best performance. Our work indicates that networks featuring strong Mexican-Hat-shaped recurrent connections in the top layer, where decision making is governed by the formation of a stable activity bump in the neural population (a “non-democratic” mechanism), achieve mediocre learning results at best. In absence of recurrent connections, where all neurons “vote” independently (“democratic”) for a decision via population vector readout, the task is generally learned better and more robustly. Our study would have been extremely difficult on a desktop computer without the use of GPU programming. We present the routines developed for this purpose and show that a speed improvement of 5x up to 42x is provided versus optimised Python code. The higher speed is achieved when we exploit the parallelism of the GPU in the search of learning parameters. This suggests that efficient GPU programming can significantly reduce the time needed for simulating networks of spiking neurons, particularly when multiple parameter configurations are investigated.
format	Text
id	pubmed-3087717
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-30877172011-05-13 Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations Richmond, Paul Buesing, Lars Giugliano, Michele Vasilaki, Eleni PLoS One Research Article High performance computing on the Graphics Processing Unit (GPU) is an emerging field driven by the promise of high computational power at a low cost. However, GPU programming is a non-trivial task and moreover architectural limitations raise the question of whether investing effort in this direction may be worthwhile. In this work, we use GPU programming to simulate a two-layer network of Integrate-and-Fire neurons with varying degrees of recurrent connectivity and investigate its ability to learn a simplified navigation task using a policy-gradient learning rule stemming from Reinforcement Learning. The purpose of this paper is twofold. First, we want to support the use of GPUs in the field of Computational Neuroscience. Second, using GPU computing power, we investigate the conditions under which the said architecture and learning rule demonstrate best performance. Our work indicates that networks featuring strong Mexican-Hat-shaped recurrent connections in the top layer, where decision making is governed by the formation of a stable activity bump in the neural population (a “non-democratic” mechanism), achieve mediocre learning results at best. In absence of recurrent connections, where all neurons “vote” independently (“democratic”) for a decision via population vector readout, the task is generally learned better and more robustly. Our study would have been extremely difficult on a desktop computer without the use of GPU programming. We present the routines developed for this purpose and show that a speed improvement of 5x up to 42x is provided versus optimised Python code. The higher speed is achieved when we exploit the parallelism of the GPU in the search of learning parameters. This suggests that efficient GPU programming can significantly reduce the time needed for simulating networks of spiking neurons, particularly when multiple parameter configurations are investigated. Public Library of Science 2011-05-04 /pmc/articles/PMC3087717/ /pubmed/21572529 http://dx.doi.org/10.1371/journal.pone.0018539 Text en Richmond et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Richmond, Paul Buesing, Lars Giugliano, Michele Vasilaki, Eleni Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
title	Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
title_full	Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
title_fullStr	Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
title_full_unstemmed	Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
title_short	Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
title_sort	democratic population decisions result in robust policy-gradient learning: a parametric study with gpu simulations
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3087717/ https://www.ncbi.nlm.nih.gov/pubmed/21572529 http://dx.doi.org/10.1371/journal.pone.0018539
work_keys_str_mv	AT richmondpaul democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations AT buesinglars democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations AT giuglianomichele democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations AT vasilakieleni democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations

Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations

Ejemplares similares