Cargando…
Model-based and model-free pain avoidance learning
Background: While there is good evidence that reward learning is underpinned by two distinct decision control systems – a cognitive ‘model-based’ and a habitbased ‘model-free’ system, a comparable distinction for punishment avoidance has been much less clear. Methods: We implemented a pain avoidance...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6187988/ https://www.ncbi.nlm.nih.gov/pubmed/30370339 http://dx.doi.org/10.1177/2398212818772964 |
_version_ | 1783363132521971712 |
---|---|
author | Wang, Oliver Lee, Sang Wan O’Doherty, John Seymour, Ben Yoshida, Wako |
author_facet | Wang, Oliver Lee, Sang Wan O’Doherty, John Seymour, Ben Yoshida, Wako |
author_sort | Wang, Oliver |
collection | PubMed |
description | Background: While there is good evidence that reward learning is underpinned by two distinct decision control systems – a cognitive ‘model-based’ and a habitbased ‘model-free’ system, a comparable distinction for punishment avoidance has been much less clear. Methods: We implemented a pain avoidance task that placed differential emphasis on putative model-based and model-free processing, mirroring a paradigm and modelling approach recently developed for reward-based decision-making. Subjects performed a two-step decision-making task with probabilistic pain outcomes of different quantities. The delivery of outcomes was sometimes contingent on a rule signalled at the beginning of each trial, emulating a form of outcome devaluation. Results: The behavioural data showed that subjects tended to use a mixed strategy – favouring the simpler model-free learning strategy when outcomes did not depend on the rule, and favouring a model-based when they did. Furthermore, the data were well described by a dynamic transition model between the two controllers. When compared with data from a reward-based task (albeit tested in the context of the scanner), we observed that avoidance involved a significantly greater tendency for subjects to switch between model-free and model-based systems in the face of changes in uncertainty. Conclusion: Our study suggests a dual-system model of pain avoidance, similar to but possibly more dynamically flexible than reward-based decision-making. |
format | Online Article Text |
id | pubmed-6187988 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-61879882018-10-24 Model-based and model-free pain avoidance learning Wang, Oliver Lee, Sang Wan O’Doherty, John Seymour, Ben Yoshida, Wako Brain Neurosci Adv Special Collection on Prefrontal Cortex Background: While there is good evidence that reward learning is underpinned by two distinct decision control systems – a cognitive ‘model-based’ and a habitbased ‘model-free’ system, a comparable distinction for punishment avoidance has been much less clear. Methods: We implemented a pain avoidance task that placed differential emphasis on putative model-based and model-free processing, mirroring a paradigm and modelling approach recently developed for reward-based decision-making. Subjects performed a two-step decision-making task with probabilistic pain outcomes of different quantities. The delivery of outcomes was sometimes contingent on a rule signalled at the beginning of each trial, emulating a form of outcome devaluation. Results: The behavioural data showed that subjects tended to use a mixed strategy – favouring the simpler model-free learning strategy when outcomes did not depend on the rule, and favouring a model-based when they did. Furthermore, the data were well described by a dynamic transition model between the two controllers. When compared with data from a reward-based task (albeit tested in the context of the scanner), we observed that avoidance involved a significantly greater tendency for subjects to switch between model-free and model-based systems in the face of changes in uncertainty. Conclusion: Our study suggests a dual-system model of pain avoidance, similar to but possibly more dynamically flexible than reward-based decision-making. SAGE Publications 2018-05-10 /pmc/articles/PMC6187988/ /pubmed/30370339 http://dx.doi.org/10.1177/2398212818772964 Text en © The Author(s) 2018 http://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution 4.0 License (http://www.creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Special Collection on Prefrontal Cortex Wang, Oliver Lee, Sang Wan O’Doherty, John Seymour, Ben Yoshida, Wako Model-based and model-free pain avoidance learning |
title | Model-based and model-free pain avoidance learning |
title_full | Model-based and model-free pain avoidance learning |
title_fullStr | Model-based and model-free pain avoidance learning |
title_full_unstemmed | Model-based and model-free pain avoidance learning |
title_short | Model-based and model-free pain avoidance learning |
title_sort | model-based and model-free pain avoidance learning |
topic | Special Collection on Prefrontal Cortex |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6187988/ https://www.ncbi.nlm.nih.gov/pubmed/30370339 http://dx.doi.org/10.1177/2398212818772964 |
work_keys_str_mv | AT wangoliver modelbasedandmodelfreepainavoidancelearning AT leesangwan modelbasedandmodelfreepainavoidancelearning AT odohertyjohn modelbasedandmodelfreepainavoidancelearning AT seymourben modelbasedandmodelfreepainavoidancelearning AT yoshidawako modelbasedandmodelfreepainavoidancelearning |