Cargando…
Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987535/ https://www.ncbi.nlm.nih.gov/pubmed/27511383 http://dx.doi.org/10.1038/ncomms12438 |
_version_ | 1782448324920475648 |
---|---|
author | Konovalov, Arkady Krajbich, Ian |
author_facet | Konovalov, Arkady Krajbich, Ian |
author_sort | Konovalov, Arkady |
collection | PubMed |
description | Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior work has used neural data to argue that both model-based and model-free learners implement a value comparison process at trial onset, but model-based learners assign more weight to forward-looking computations. Here using eye-tracking, we report evidence for a different interpretation of prior results: model-based subjects make their choices prior to trial onset. In contrast, model-free subjects tend to ignore model-based aspects of the task and instead seem to treat the decision problem as a simple comparison process between two differentially valued items, consistent with previous work on sequential-sampling models of decision making. These findings illustrate a problem with assuming that experimental subjects make their decisions at the same prescribed time. |
format | Online Article Text |
id | pubmed-4987535 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-49875352016-08-30 Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning Konovalov, Arkady Krajbich, Ian Nat Commun Article Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior work has used neural data to argue that both model-based and model-free learners implement a value comparison process at trial onset, but model-based learners assign more weight to forward-looking computations. Here using eye-tracking, we report evidence for a different interpretation of prior results: model-based subjects make their choices prior to trial onset. In contrast, model-free subjects tend to ignore model-based aspects of the task and instead seem to treat the decision problem as a simple comparison process between two differentially valued items, consistent with previous work on sequential-sampling models of decision making. These findings illustrate a problem with assuming that experimental subjects make their decisions at the same prescribed time. Nature Publishing Group 2016-08-11 /pmc/articles/PMC4987535/ /pubmed/27511383 http://dx.doi.org/10.1038/ncomms12438 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Konovalov, Arkady Krajbich, Ian Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning |
title | Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning |
title_full | Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning |
title_fullStr | Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning |
title_full_unstemmed | Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning |
title_short | Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning |
title_sort | gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987535/ https://www.ncbi.nlm.nih.gov/pubmed/27511383 http://dx.doi.org/10.1038/ncomms12438 |
work_keys_str_mv | AT konovalovarkady gazedatarevealdistinctchoiceprocessesunderlyingmodelbasedandmodelfreereinforcementlearning AT krajbichian gazedatarevealdistinctchoiceprocessesunderlyingmodelbasedandmodelfreereinforcementlearning |