Cargando…

Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum

Previous theoretical studies of animal and human behavioral learning have focused on the dichotomy of the value-based strategy using action value functions to predict rewards and the model-based strategy using internal models to predict environmental states. However, animals and humans often take si...

Descripción completa

Detalles Bibliográficos
Autores principales: Ito, Makoto, Doya, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4631489/
https://www.ncbi.nlm.nih.gov/pubmed/26529522
http://dx.doi.org/10.1371/journal.pcbi.1004540
_version_ 1782398873226969088
author Ito, Makoto
Doya, Kenji
author_facet Ito, Makoto
Doya, Kenji
author_sort Ito, Makoto
collection PubMed
description Previous theoretical studies of animal and human behavioral learning have focused on the dichotomy of the value-based strategy using action value functions to predict rewards and the model-based strategy using internal models to predict environmental states. However, animals and humans often take simple procedural behaviors, such as the “win-stay, lose-switch” strategy without explicit prediction of rewards or states. Here we consider another strategy, the finite state-based strategy, in which a subject selects an action depending on its discrete internal state and updates the state depending on the action chosen and the reward outcome. By analyzing choice behavior of rats in a free-choice task, we found that the finite state-based strategy fitted their behavioral choices more accurately than value-based and model-based strategies did. When fitted models were run autonomously with the same task, only the finite state-based strategy could reproduce the key feature of choice sequences. Analyses of neural activity recorded from the dorsolateral striatum (DLS), the dorsomedial striatum (DMS), and the ventral striatum (VS) identified significant fractions of neurons in all three subareas for which activities were correlated with individual states of the finite state-based strategy. The signal of internal states at the time of choice was found in DMS, and for clusters of states was found in VS. In addition, action values and state values of the value-based strategy were encoded in DMS and VS, respectively. These results suggest that both the value-based strategy and the finite state-based strategy are implemented in the striatum.
format Online
Article
Text
id pubmed-4631489
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46314892015-11-13 Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum Ito, Makoto Doya, Kenji PLoS Comput Biol Research Article Previous theoretical studies of animal and human behavioral learning have focused on the dichotomy of the value-based strategy using action value functions to predict rewards and the model-based strategy using internal models to predict environmental states. However, animals and humans often take simple procedural behaviors, such as the “win-stay, lose-switch” strategy without explicit prediction of rewards or states. Here we consider another strategy, the finite state-based strategy, in which a subject selects an action depending on its discrete internal state and updates the state depending on the action chosen and the reward outcome. By analyzing choice behavior of rats in a free-choice task, we found that the finite state-based strategy fitted their behavioral choices more accurately than value-based and model-based strategies did. When fitted models were run autonomously with the same task, only the finite state-based strategy could reproduce the key feature of choice sequences. Analyses of neural activity recorded from the dorsolateral striatum (DLS), the dorsomedial striatum (DMS), and the ventral striatum (VS) identified significant fractions of neurons in all three subareas for which activities were correlated with individual states of the finite state-based strategy. The signal of internal states at the time of choice was found in DMS, and for clusters of states was found in VS. In addition, action values and state values of the value-based strategy were encoded in DMS and VS, respectively. These results suggest that both the value-based strategy and the finite state-based strategy are implemented in the striatum. Public Library of Science 2015-11-03 /pmc/articles/PMC4631489/ /pubmed/26529522 http://dx.doi.org/10.1371/journal.pcbi.1004540 Text en © 2015 Ito, Doya http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ito, Makoto
Doya, Kenji
Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum
title Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum
title_full Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum
title_fullStr Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum
title_full_unstemmed Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum
title_short Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum
title_sort parallel representation of value-based and finite state-based strategies in the ventral and dorsal striatum
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4631489/
https://www.ncbi.nlm.nih.gov/pubmed/26529522
http://dx.doi.org/10.1371/journal.pcbi.1004540
work_keys_str_mv AT itomakoto parallelrepresentationofvaluebasedandfinitestatebasedstrategiesintheventralanddorsalstriatum
AT doyakenji parallelrepresentationofvaluebasedandfinitestatebasedstrategiesintheventralanddorsalstriatum