Cargando…

A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning

Reinforcement learning has been widely used in explaining animal behavior. In reinforcement learning, the agent learns the value of the states in the task, collectively constituting the task state space, and uses the knowledge to choose actions and acquire desired outcomes. It has been proposed that...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhewei, Cheng, Zhenbo, Lin, Zhongqiao, Nie, Chechang, Yang, Tianming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5771635/
https://www.ncbi.nlm.nih.gov/pubmed/29300746
http://dx.doi.org/10.1371/journal.pcbi.1005925
_version_ 1783293293350617088
author Zhang, Zhewei
Cheng, Zhenbo
Lin, Zhongqiao
Nie, Chechang
Yang, Tianming
author_facet Zhang, Zhewei
Cheng, Zhenbo
Lin, Zhongqiao
Nie, Chechang
Yang, Tianming
author_sort Zhang, Zhewei
collection PubMed
description Reinforcement learning has been widely used in explaining animal behavior. In reinforcement learning, the agent learns the value of the states in the task, collectively constituting the task state space, and uses the knowledge to choose actions and acquire desired outcomes. It has been proposed that the orbitofrontal cortex (OFC) encodes the task state space during reinforcement learning. However, it is not well understood how the OFC acquires and stores task state information. Here, we propose a neural network model based on reservoir computing. Reservoir networks exhibit heterogeneous and dynamic activity patterns that are suitable to encode task states. The information can be extracted by a linear readout trained with reinforcement learning. We demonstrate how the network acquires and stores task structures. The network exhibits reinforcement learning behavior and its aspects resemble experimental findings of the OFC. Our study provides a theoretical explanation of how the OFC may contribute to reinforcement learning and a new approach to understanding the neural mechanism underlying reinforcement learning.
format Online
Article
Text
id pubmed-5771635
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-57716352018-01-26 A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning Zhang, Zhewei Cheng, Zhenbo Lin, Zhongqiao Nie, Chechang Yang, Tianming PLoS Comput Biol Research Article Reinforcement learning has been widely used in explaining animal behavior. In reinforcement learning, the agent learns the value of the states in the task, collectively constituting the task state space, and uses the knowledge to choose actions and acquire desired outcomes. It has been proposed that the orbitofrontal cortex (OFC) encodes the task state space during reinforcement learning. However, it is not well understood how the OFC acquires and stores task state information. Here, we propose a neural network model based on reservoir computing. Reservoir networks exhibit heterogeneous and dynamic activity patterns that are suitable to encode task states. The information can be extracted by a linear readout trained with reinforcement learning. We demonstrate how the network acquires and stores task structures. The network exhibits reinforcement learning behavior and its aspects resemble experimental findings of the OFC. Our study provides a theoretical explanation of how the OFC may contribute to reinforcement learning and a new approach to understanding the neural mechanism underlying reinforcement learning. Public Library of Science 2018-01-04 /pmc/articles/PMC5771635/ /pubmed/29300746 http://dx.doi.org/10.1371/journal.pcbi.1005925 Text en © 2018 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Zhewei
Cheng, Zhenbo
Lin, Zhongqiao
Nie, Chechang
Yang, Tianming
A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
title A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
title_full A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
title_fullStr A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
title_full_unstemmed A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
title_short A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
title_sort neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5771635/
https://www.ncbi.nlm.nih.gov/pubmed/29300746
http://dx.doi.org/10.1371/journal.pcbi.1005925
work_keys_str_mv AT zhangzhewei aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT chengzhenbo aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT linzhongqiao aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT niechechang aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT yangtianming aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT zhangzhewei neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT chengzhenbo neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT linzhongqiao neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT niechechang neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning
AT yangtianming neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning