Cargando…

Choice history effects in mice and humans improve reward harvesting efficiency

Choice history effects describe how future choices depend on the history of past choices. In experimental tasks this is typically framed as a bias because it often diminishes the experienced reward rates. However, in natural habitats, choices made in the past constrain choices that can be made in th...

Descripción completa

Detalles Bibliográficos
Autores principales:	López-Yépez, Junior Samuel, Martin, Juliane, Hulme, Oliver, Kvitsiani, Duda
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8516315/ https://www.ncbi.nlm.nih.gov/pubmed/34606493 http://dx.doi.org/10.1371/journal.pcbi.1009452

_version_	1784583776474497024
author	López-Yépez, Junior Samuel Martin, Juliane Hulme, Oliver Kvitsiani, Duda
author_facet	López-Yépez, Junior Samuel Martin, Juliane Hulme, Oliver Kvitsiani, Duda
author_sort	López-Yépez, Junior Samuel
collection	PubMed
description	Choice history effects describe how future choices depend on the history of past choices. In experimental tasks this is typically framed as a bias because it often diminishes the experienced reward rates. However, in natural habitats, choices made in the past constrain choices that can be made in the future. For foraging animals, the probability of earning a reward in a given patch depends on the degree to which the animals have exploited the patch in the past. One problem with many experimental tasks that show choice history effects is that such tasks artificially decouple choice history from its consequences on reward availability over time. To circumvent this, we use a variable interval (VI) reward schedule that reinstates a more natural contingency between past choices and future reward availability. By examining the behavior of optimal agents in the VI task we discover that choice history effects observed in animals serve to maximize reward harvesting efficiency. We further distil the function of choice history effects by manipulating first- and second-order statistics of the environment. We find that choice history effects primarily reflect the growth rate of the reward probability of the unchosen option, whereas reward history effects primarily reflect environmental volatility. Based on observed choice history effects in animals, we develop a reinforcement learning model that explicitly incorporates choice history over multiple time scales into the decision process, and we assess its predictive adequacy in accounting for the associated behavior. We show that this new variant, known as the double trace model, has a higher performance in predicting choice data, and shows near optimal reward harvesting efficiency in simulated environments. These results suggests that choice history effects may be adaptive for natural contingencies between consumption and reward availability. This concept lends credence to a normative account of choice history effects that extends beyond its description as a bias.
format	Online Article Text
id	pubmed-8516315
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-85163152021-10-15 Choice history effects in mice and humans improve reward harvesting efficiency López-Yépez, Junior Samuel Martin, Juliane Hulme, Oliver Kvitsiani, Duda PLoS Comput Biol Research Article Choice history effects describe how future choices depend on the history of past choices. In experimental tasks this is typically framed as a bias because it often diminishes the experienced reward rates. However, in natural habitats, choices made in the past constrain choices that can be made in the future. For foraging animals, the probability of earning a reward in a given patch depends on the degree to which the animals have exploited the patch in the past. One problem with many experimental tasks that show choice history effects is that such tasks artificially decouple choice history from its consequences on reward availability over time. To circumvent this, we use a variable interval (VI) reward schedule that reinstates a more natural contingency between past choices and future reward availability. By examining the behavior of optimal agents in the VI task we discover that choice history effects observed in animals serve to maximize reward harvesting efficiency. We further distil the function of choice history effects by manipulating first- and second-order statistics of the environment. We find that choice history effects primarily reflect the growth rate of the reward probability of the unchosen option, whereas reward history effects primarily reflect environmental volatility. Based on observed choice history effects in animals, we develop a reinforcement learning model that explicitly incorporates choice history over multiple time scales into the decision process, and we assess its predictive adequacy in accounting for the associated behavior. We show that this new variant, known as the double trace model, has a higher performance in predicting choice data, and shows near optimal reward harvesting efficiency in simulated environments. These results suggests that choice history effects may be adaptive for natural contingencies between consumption and reward availability. This concept lends credence to a normative account of choice history effects that extends beyond its description as a bias. Public Library of Science 2021-10-04 /pmc/articles/PMC8516315/ /pubmed/34606493 http://dx.doi.org/10.1371/journal.pcbi.1009452 Text en © 2021 López-Yépez et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article López-Yépez, Junior Samuel Martin, Juliane Hulme, Oliver Kvitsiani, Duda Choice history effects in mice and humans improve reward harvesting efficiency
title	Choice history effects in mice and humans improve reward harvesting efficiency
title_full	Choice history effects in mice and humans improve reward harvesting efficiency
title_fullStr	Choice history effects in mice and humans improve reward harvesting efficiency
title_full_unstemmed	Choice history effects in mice and humans improve reward harvesting efficiency
title_short	Choice history effects in mice and humans improve reward harvesting efficiency
title_sort	choice history effects in mice and humans improve reward harvesting efficiency
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8516315/ https://www.ncbi.nlm.nih.gov/pubmed/34606493 http://dx.doi.org/10.1371/journal.pcbi.1009452
work_keys_str_mv	AT lopezyepezjuniorsamuel choicehistoryeffectsinmiceandhumansimproverewardharvestingefficiency AT martinjuliane choicehistoryeffectsinmiceandhumansimproverewardharvestingefficiency AT hulmeoliver choicehistoryeffectsinmiceandhumansimproverewardharvestingefficiency AT kvitsianiduda choicehistoryeffectsinmiceandhumansimproverewardharvestingefficiency

Choice history effects in mice and humans improve reward harvesting efficiency

Ejemplares similares