Cargando…
A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning
Reinforcement learning has been widely used in explaining animal behavior. In reinforcement learning, the agent learns the value of the states in the task, collectively constituting the task state space, and uses the knowledge to choose actions and acquire desired outcomes. It has been proposed that...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5771635/ https://www.ncbi.nlm.nih.gov/pubmed/29300746 http://dx.doi.org/10.1371/journal.pcbi.1005925 |
_version_ | 1783293293350617088 |
---|---|
author | Zhang, Zhewei Cheng, Zhenbo Lin, Zhongqiao Nie, Chechang Yang, Tianming |
author_facet | Zhang, Zhewei Cheng, Zhenbo Lin, Zhongqiao Nie, Chechang Yang, Tianming |
author_sort | Zhang, Zhewei |
collection | PubMed |
description | Reinforcement learning has been widely used in explaining animal behavior. In reinforcement learning, the agent learns the value of the states in the task, collectively constituting the task state space, and uses the knowledge to choose actions and acquire desired outcomes. It has been proposed that the orbitofrontal cortex (OFC) encodes the task state space during reinforcement learning. However, it is not well understood how the OFC acquires and stores task state information. Here, we propose a neural network model based on reservoir computing. Reservoir networks exhibit heterogeneous and dynamic activity patterns that are suitable to encode task states. The information can be extracted by a linear readout trained with reinforcement learning. We demonstrate how the network acquires and stores task structures. The network exhibits reinforcement learning behavior and its aspects resemble experimental findings of the OFC. Our study provides a theoretical explanation of how the OFC may contribute to reinforcement learning and a new approach to understanding the neural mechanism underlying reinforcement learning. |
format | Online Article Text |
id | pubmed-5771635 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-57716352018-01-26 A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning Zhang, Zhewei Cheng, Zhenbo Lin, Zhongqiao Nie, Chechang Yang, Tianming PLoS Comput Biol Research Article Reinforcement learning has been widely used in explaining animal behavior. In reinforcement learning, the agent learns the value of the states in the task, collectively constituting the task state space, and uses the knowledge to choose actions and acquire desired outcomes. It has been proposed that the orbitofrontal cortex (OFC) encodes the task state space during reinforcement learning. However, it is not well understood how the OFC acquires and stores task state information. Here, we propose a neural network model based on reservoir computing. Reservoir networks exhibit heterogeneous and dynamic activity patterns that are suitable to encode task states. The information can be extracted by a linear readout trained with reinforcement learning. We demonstrate how the network acquires and stores task structures. The network exhibits reinforcement learning behavior and its aspects resemble experimental findings of the OFC. Our study provides a theoretical explanation of how the OFC may contribute to reinforcement learning and a new approach to understanding the neural mechanism underlying reinforcement learning. Public Library of Science 2018-01-04 /pmc/articles/PMC5771635/ /pubmed/29300746 http://dx.doi.org/10.1371/journal.pcbi.1005925 Text en © 2018 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhang, Zhewei Cheng, Zhenbo Lin, Zhongqiao Nie, Chechang Yang, Tianming A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning |
title | A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning |
title_full | A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning |
title_fullStr | A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning |
title_full_unstemmed | A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning |
title_short | A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning |
title_sort | neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5771635/ https://www.ncbi.nlm.nih.gov/pubmed/29300746 http://dx.doi.org/10.1371/journal.pcbi.1005925 |
work_keys_str_mv | AT zhangzhewei aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT chengzhenbo aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT linzhongqiao aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT niechechang aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT yangtianming aneuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT zhangzhewei neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT chengzhenbo neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT linzhongqiao neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT niechechang neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning AT yangtianming neuralnetworkmodelfortheorbitofrontalcortexandtaskspaceacquisitionduringreinforcementlearning |