Cargando…

The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation

Instrumental behavior depends on both goal-directed and habitual mechanisms of choice. Normative views cast these mechanisms in terms of model-free and model-based methods of reinforcement learning, respectively. An influential proposal hypothesizes that model-free and model-based mechanisms coexist...

Descripción completa

Detalles Bibliográficos
Autores principales: Pezzulo, Giovanni, Rigoli, Francesco, Chersi, Fabian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3586710/
https://www.ncbi.nlm.nih.gov/pubmed/23459512
http://dx.doi.org/10.3389/fpsyg.2013.00092
_version_ 1782261344415776768
author Pezzulo, Giovanni
Rigoli, Francesco
Chersi, Fabian
author_facet Pezzulo, Giovanni
Rigoli, Francesco
Chersi, Fabian
author_sort Pezzulo, Giovanni
collection PubMed
description Instrumental behavior depends on both goal-directed and habitual mechanisms of choice. Normative views cast these mechanisms in terms of model-free and model-based methods of reinforcement learning, respectively. An influential proposal hypothesizes that model-free and model-based mechanisms coexist and compete in the brain according to their relative uncertainty. In this paper we propose a novel view in which a single Mixed Instrumental Controller produces both goal-directed and habitual behavior by flexibly balancing and combining model-based and model-free computations. The Mixed Instrumental Controller performs a cost-benefits analysis to decide whether to chose an action immediately based on the available “cached” value of actions (linked to model-free mechanisms) or to improve value estimation by mentally simulating the expected outcome values (linked to model-based mechanisms). Since mental simulation entails cognitive effort and increases the reward delay, it is activated only when the associated “Value of Information” exceeds its costs. The model proposes a method to compute the Value of Information, based on the uncertainty of action values and on the distance of alternative cached action values. Overall, the model by default chooses on the basis of lighter model-free estimates, and integrates them with costly model-based predictions only when useful. Mental simulation uses a sampling method to produce reward expectancies, which are used to update the cached value of one or more actions; in turn, this updated value is used for the choice. The key predictions of the model are tested in different settings of a double T-maze scenario. Results are discussed in relation with neurobiological evidence on the hippocampus – ventral striatum circuit in rodents, which has been linked to goal-directed spatial navigation.
format Online
Article
Text
id pubmed-3586710
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-35867102013-03-04 The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation Pezzulo, Giovanni Rigoli, Francesco Chersi, Fabian Front Psychol Psychology Instrumental behavior depends on both goal-directed and habitual mechanisms of choice. Normative views cast these mechanisms in terms of model-free and model-based methods of reinforcement learning, respectively. An influential proposal hypothesizes that model-free and model-based mechanisms coexist and compete in the brain according to their relative uncertainty. In this paper we propose a novel view in which a single Mixed Instrumental Controller produces both goal-directed and habitual behavior by flexibly balancing and combining model-based and model-free computations. The Mixed Instrumental Controller performs a cost-benefits analysis to decide whether to chose an action immediately based on the available “cached” value of actions (linked to model-free mechanisms) or to improve value estimation by mentally simulating the expected outcome values (linked to model-based mechanisms). Since mental simulation entails cognitive effort and increases the reward delay, it is activated only when the associated “Value of Information” exceeds its costs. The model proposes a method to compute the Value of Information, based on the uncertainty of action values and on the distance of alternative cached action values. Overall, the model by default chooses on the basis of lighter model-free estimates, and integrates them with costly model-based predictions only when useful. Mental simulation uses a sampling method to produce reward expectancies, which are used to update the cached value of one or more actions; in turn, this updated value is used for the choice. The key predictions of the model are tested in different settings of a double T-maze scenario. Results are discussed in relation with neurobiological evidence on the hippocampus – ventral striatum circuit in rodents, which has been linked to goal-directed spatial navigation. Frontiers Media S.A. 2013-03-04 /pmc/articles/PMC3586710/ /pubmed/23459512 http://dx.doi.org/10.3389/fpsyg.2013.00092 Text en Copyright © 2013 Pezzulo, Rigoli and Chersi. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle Psychology
Pezzulo, Giovanni
Rigoli, Francesco
Chersi, Fabian
The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation
title The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation
title_full The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation
title_fullStr The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation
title_full_unstemmed The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation
title_short The Mixed Instrumental Controller: Using Value of Information to Combine Habitual Choice and Mental Simulation
title_sort mixed instrumental controller: using value of information to combine habitual choice and mental simulation
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3586710/
https://www.ncbi.nlm.nih.gov/pubmed/23459512
http://dx.doi.org/10.3389/fpsyg.2013.00092
work_keys_str_mv AT pezzulogiovanni themixedinstrumentalcontrollerusingvalueofinformationtocombinehabitualchoiceandmentalsimulation
AT rigolifrancesco themixedinstrumentalcontrollerusingvalueofinformationtocombinehabitualchoiceandmentalsimulation
AT chersifabian themixedinstrumentalcontrollerusingvalueofinformationtocombinehabitualchoiceandmentalsimulation
AT pezzulogiovanni mixedinstrumentalcontrollerusingvalueofinformationtocombinehabitualchoiceandmentalsimulation
AT rigolifrancesco mixedinstrumentalcontrollerusingvalueofinformationtocombinehabitualchoiceandmentalsimulation
AT chersifabian mixedinstrumentalcontrollerusingvalueofinformationtocombinehabitualchoiceandmentalsimulation