Cargando…

A generative spiking neural-network model of goal-directed behaviour and one-step planning

In mammals, goal-directed and planning processes support flexible behaviour used to face new situations that cannot be tackled through more efficient but rigid habitual behaviours. Within the Bayesian modelling approach of brain and behaviour, models have been proposed to perform planning as probabi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Basanisi, Ruggero, Brovelli, Andrea, Cartoni, Emilio, Baldassarre, Gianluca
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2020
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7748287/ https://www.ncbi.nlm.nih.gov/pubmed/33290414 http://dx.doi.org/10.1371/journal.pcbi.1007579

_version_	1783625083182383104
author	Basanisi, Ruggero Brovelli, Andrea Cartoni, Emilio Baldassarre, Gianluca
author_facet	Basanisi, Ruggero Brovelli, Andrea Cartoni, Emilio Baldassarre, Gianluca
author_sort	Basanisi, Ruggero
collection	PubMed
description	In mammals, goal-directed and planning processes support flexible behaviour used to face new situations that cannot be tackled through more efficient but rigid habitual behaviours. Within the Bayesian modelling approach of brain and behaviour, models have been proposed to perform planning as probabilistic inference but this approach encounters a crucial problem: explaining how such inference might be implemented in brain spiking networks. Recently, the literature has proposed some models that face this problem through recurrent spiking neural networks able to internally simulate state trajectories, the core function at the basis of planning. However, the proposed models have relevant limitations that make them biologically implausible, namely their world model is trained ‘off-line’ before solving the target tasks, and they are trained with supervised learning procedures that are biologically and ecologically not plausible. Here we propose two novel hypotheses on how brain might overcome these problems, and operationalise them in a novel architecture pivoting on a spiking recurrent neural network. The first hypothesis allows the architecture to learn the world model in parallel with its use for planning: to this purpose, a new arbitration mechanism decides when to explore, for learning the world model, or when to exploit it, for planning, based on the entropy of the world model itself. The second hypothesis allows the architecture to use an unsupervised learning process to learn the world model by observing the effects of actions. The architecture is validated by reproducing and accounting for the learning profiles and reaction times of human participants learning to solve a visuomotor learning task that is new for them. Overall, the architecture represents the first instance of a model bridging probabilistic planning and spiking-processes that has a degree of autonomy analogous to the one of real organisms.
format	Online Article Text
id	pubmed-7748287
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-77482872021-01-04 A generative spiking neural-network model of goal-directed behaviour and one-step planning Basanisi, Ruggero Brovelli, Andrea Cartoni, Emilio Baldassarre, Gianluca PLoS Comput Biol Research Article In mammals, goal-directed and planning processes support flexible behaviour used to face new situations that cannot be tackled through more efficient but rigid habitual behaviours. Within the Bayesian modelling approach of brain and behaviour, models have been proposed to perform planning as probabilistic inference but this approach encounters a crucial problem: explaining how such inference might be implemented in brain spiking networks. Recently, the literature has proposed some models that face this problem through recurrent spiking neural networks able to internally simulate state trajectories, the core function at the basis of planning. However, the proposed models have relevant limitations that make them biologically implausible, namely their world model is trained ‘off-line’ before solving the target tasks, and they are trained with supervised learning procedures that are biologically and ecologically not plausible. Here we propose two novel hypotheses on how brain might overcome these problems, and operationalise them in a novel architecture pivoting on a spiking recurrent neural network. The first hypothesis allows the architecture to learn the world model in parallel with its use for planning: to this purpose, a new arbitration mechanism decides when to explore, for learning the world model, or when to exploit it, for planning, based on the entropy of the world model itself. The second hypothesis allows the architecture to use an unsupervised learning process to learn the world model by observing the effects of actions. The architecture is validated by reproducing and accounting for the learning profiles and reaction times of human participants learning to solve a visuomotor learning task that is new for them. Overall, the architecture represents the first instance of a model bridging probabilistic planning and spiking-processes that has a degree of autonomy analogous to the one of real organisms. Public Library of Science 2020-12-08 /pmc/articles/PMC7748287/ /pubmed/33290414 http://dx.doi.org/10.1371/journal.pcbi.1007579 Text en © 2020 Basanisi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Basanisi, Ruggero Brovelli, Andrea Cartoni, Emilio Baldassarre, Gianluca A generative spiking neural-network model of goal-directed behaviour and one-step planning
title	A generative spiking neural-network model of goal-directed behaviour and one-step planning
title_full	A generative spiking neural-network model of goal-directed behaviour and one-step planning
title_fullStr	A generative spiking neural-network model of goal-directed behaviour and one-step planning
title_full_unstemmed	A generative spiking neural-network model of goal-directed behaviour and one-step planning
title_short	A generative spiking neural-network model of goal-directed behaviour and one-step planning
title_sort	generative spiking neural-network model of goal-directed behaviour and one-step planning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7748287/ https://www.ncbi.nlm.nih.gov/pubmed/33290414 http://dx.doi.org/10.1371/journal.pcbi.1007579
work_keys_str_mv	AT basanisiruggero agenerativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning AT brovelliandrea agenerativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning AT cartoniemilio agenerativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning AT baldassarregianluca agenerativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning AT basanisiruggero generativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning AT brovelliandrea generativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning AT cartoniemilio generativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning AT baldassarregianluca generativespikingneuralnetworkmodelofgoaldirectedbehaviourandonestepplanning

A generative spiking neural-network model of goal-directed behaviour and one-step planning

Ejemplares similares