Cargando…

Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences

In economics and perceptual decision-making contextual effects are well documented, where decision weights are adjusted as a function of the distribution of stimuli. Yet, in reinforcement learning literature whether and how contextual information pertaining to decision states is integrated in learni...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bavard, Sophie, Lebreton, Maël, Khamassi, Mehdi, Coricelli, Giorgio, Palminteri, Stefano
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6206161/ https://www.ncbi.nlm.nih.gov/pubmed/30374019 http://dx.doi.org/10.1038/s41467-018-06781-2

_version_	1783366315284627456
author	Bavard, Sophie Lebreton, Maël Khamassi, Mehdi Coricelli, Giorgio Palminteri, Stefano
author_facet	Bavard, Sophie Lebreton, Maël Khamassi, Mehdi Coricelli, Giorgio Palminteri, Stefano
author_sort	Bavard, Sophie
collection	PubMed
description	In economics and perceptual decision-making contextual effects are well documented, where decision weights are adjusted as a function of the distribution of stimuli. Yet, in reinforcement learning literature whether and how contextual information pertaining to decision states is integrated in learning algorithms has received comparably little attention. Here, we investigate reinforcement learning behavior and its computational substrates in a task where we orthogonally manipulate outcome valence and magnitude, resulting in systematic variations in state-values. Model comparison indicates that subjects’ behavior is best accounted for by an algorithm which includes both reference point-dependence and range-adaptation—two crucial features of state-dependent valuation. In addition, we find that state-dependent outcome valuation progressively emerges, is favored by increasing outcome information and correlated with explicit understanding of the task structure. Finally, our data clearly show that, while being locally adaptive (for instance in negative valence and small magnitude contexts), state-dependent valuation comes at the cost of seemingly irrational choices, when options are extrapolated out from their original contexts.
format	Online Article Text
id	pubmed-6206161
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-62061612018-10-31 Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences Bavard, Sophie Lebreton, Maël Khamassi, Mehdi Coricelli, Giorgio Palminteri, Stefano Nat Commun Article In economics and perceptual decision-making contextual effects are well documented, where decision weights are adjusted as a function of the distribution of stimuli. Yet, in reinforcement learning literature whether and how contextual information pertaining to decision states is integrated in learning algorithms has received comparably little attention. Here, we investigate reinforcement learning behavior and its computational substrates in a task where we orthogonally manipulate outcome valence and magnitude, resulting in systematic variations in state-values. Model comparison indicates that subjects’ behavior is best accounted for by an algorithm which includes both reference point-dependence and range-adaptation—two crucial features of state-dependent valuation. In addition, we find that state-dependent outcome valuation progressively emerges, is favored by increasing outcome information and correlated with explicit understanding of the task structure. Finally, our data clearly show that, while being locally adaptive (for instance in negative valence and small magnitude contexts), state-dependent valuation comes at the cost of seemingly irrational choices, when options are extrapolated out from their original contexts. Nature Publishing Group UK 2018-10-29 /pmc/articles/PMC6206161/ /pubmed/30374019 http://dx.doi.org/10.1038/s41467-018-06781-2 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Bavard, Sophie Lebreton, Maël Khamassi, Mehdi Coricelli, Giorgio Palminteri, Stefano Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
title	Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
title_full	Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
title_fullStr	Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
title_full_unstemmed	Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
title_short	Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
title_sort	reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6206161/ https://www.ncbi.nlm.nih.gov/pubmed/30374019 http://dx.doi.org/10.1038/s41467-018-06781-2
work_keys_str_mv	AT bavardsophie referencepointcenteringandrangeadaptationenhancehumanreinforcementlearningatthecostofirrationalpreferences AT lebretonmael referencepointcenteringandrangeadaptationenhancehumanreinforcementlearningatthecostofirrationalpreferences AT khamassimehdi referencepointcenteringandrangeadaptationenhancehumanreinforcementlearningatthecostofirrationalpreferences AT coricelligiorgio referencepointcenteringandrangeadaptationenhancehumanreinforcementlearningatthecostofirrationalpreferences AT palminteristefano referencepointcenteringandrangeadaptationenhancehumanreinforcementlearningatthecostofirrationalpreferences

Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences

Ejemplares similares