Cargando…

Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning

Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior...

Descripción completa

Detalles Bibliográficos
Autores principales: Konovalov, Arkady, Krajbich, Ian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987535/
https://www.ncbi.nlm.nih.gov/pubmed/27511383
http://dx.doi.org/10.1038/ncomms12438
_version_ 1782448324920475648
author Konovalov, Arkady
Krajbich, Ian
author_facet Konovalov, Arkady
Krajbich, Ian
author_sort Konovalov, Arkady
collection PubMed
description Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior work has used neural data to argue that both model-based and model-free learners implement a value comparison process at trial onset, but model-based learners assign more weight to forward-looking computations. Here using eye-tracking, we report evidence for a different interpretation of prior results: model-based subjects make their choices prior to trial onset. In contrast, model-free subjects tend to ignore model-based aspects of the task and instead seem to treat the decision problem as a simple comparison process between two differentially valued items, consistent with previous work on sequential-sampling models of decision making. These findings illustrate a problem with assuming that experimental subjects make their decisions at the same prescribed time.
format Online
Article
Text
id pubmed-4987535
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49875352016-08-30 Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning Konovalov, Arkady Krajbich, Ian Nat Commun Article Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior work has used neural data to argue that both model-based and model-free learners implement a value comparison process at trial onset, but model-based learners assign more weight to forward-looking computations. Here using eye-tracking, we report evidence for a different interpretation of prior results: model-based subjects make their choices prior to trial onset. In contrast, model-free subjects tend to ignore model-based aspects of the task and instead seem to treat the decision problem as a simple comparison process between two differentially valued items, consistent with previous work on sequential-sampling models of decision making. These findings illustrate a problem with assuming that experimental subjects make their decisions at the same prescribed time. Nature Publishing Group 2016-08-11 /pmc/articles/PMC4987535/ /pubmed/27511383 http://dx.doi.org/10.1038/ncomms12438 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Konovalov, Arkady
Krajbich, Ian
Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
title Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
title_full Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
title_fullStr Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
title_full_unstemmed Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
title_short Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
title_sort gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987535/
https://www.ncbi.nlm.nih.gov/pubmed/27511383
http://dx.doi.org/10.1038/ncomms12438
work_keys_str_mv AT konovalovarkady gazedatarevealdistinctchoiceprocessesunderlyingmodelbasedandmodelfreereinforcementlearning
AT krajbichian gazedatarevealdistinctchoiceprocessesunderlyingmodelbasedandmodelfreereinforcementlearning