Cargando…

Efficiency and prioritization of inference-based credit assignment

Organisms adapt to their environments by learning to approach states that predict rewards and avoid states associated with punishments. Knowledge about the affective value of states often relies on credit assignment (CA), whereby state values are updated on the basis of reward feedback. Remarkably,...

Descripción completa

Detalles Bibliográficos
Autores principales: Moran, Rani, Dayan, Peter, Dolan, Raymond J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cell Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8279739/
https://www.ncbi.nlm.nih.gov/pubmed/33887181
http://dx.doi.org/10.1016/j.cub.2021.03.091
_version_ 1783722507892686848
author Moran, Rani
Dayan, Peter
Dolan, Raymond J.
author_facet Moran, Rani
Dayan, Peter
Dolan, Raymond J.
author_sort Moran, Rani
collection PubMed
description Organisms adapt to their environments by learning to approach states that predict rewards and avoid states associated with punishments. Knowledge about the affective value of states often relies on credit assignment (CA), whereby state values are updated on the basis of reward feedback. Remarkably, humans assign credit to states that are not observed but are instead inferred based on a cognitive map that represents structural knowledge of an environment. A pertinent example is authors attempting to infer the identity of anonymous reviewers to assign them credit or blame and, on this basis, inform future referee recommendations. Although inference is cognitively costly, it is unknown how it influences CA or how it is apportioned between hidden and observable states (for example, both anonymous and revealed reviewers). We addressed these questions in a task that provided choices between lotteries where each led to a unique pair of occasionally rewarding outcome states. On some trials, both states were observable (rendering inference nugatory), whereas on others, the identity of one of the states was concealed. Importantly, by exploiting knowledge of choice-state associations, subjects could infer the identity of this hidden state. We show that having to perform inference reduces state-value updates. Strikingly, and in violation of normative theories, this reduction in CA was selective for the observed outcome alone. These findings have implications for the operation of putative cognitive maps.
format Online
Article
Text
id pubmed-8279739
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cell Press
record_format MEDLINE/PubMed
spelling pubmed-82797392021-07-20 Efficiency and prioritization of inference-based credit assignment Moran, Rani Dayan, Peter Dolan, Raymond J. Curr Biol Article Organisms adapt to their environments by learning to approach states that predict rewards and avoid states associated with punishments. Knowledge about the affective value of states often relies on credit assignment (CA), whereby state values are updated on the basis of reward feedback. Remarkably, humans assign credit to states that are not observed but are instead inferred based on a cognitive map that represents structural knowledge of an environment. A pertinent example is authors attempting to infer the identity of anonymous reviewers to assign them credit or blame and, on this basis, inform future referee recommendations. Although inference is cognitively costly, it is unknown how it influences CA or how it is apportioned between hidden and observable states (for example, both anonymous and revealed reviewers). We addressed these questions in a task that provided choices between lotteries where each led to a unique pair of occasionally rewarding outcome states. On some trials, both states were observable (rendering inference nugatory), whereas on others, the identity of one of the states was concealed. Importantly, by exploiting knowledge of choice-state associations, subjects could infer the identity of this hidden state. We show that having to perform inference reduces state-value updates. Strikingly, and in violation of normative theories, this reduction in CA was selective for the observed outcome alone. These findings have implications for the operation of putative cognitive maps. Cell Press 2021-07-12 /pmc/articles/PMC8279739/ /pubmed/33887181 http://dx.doi.org/10.1016/j.cub.2021.03.091 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Moran, Rani
Dayan, Peter
Dolan, Raymond J.
Efficiency and prioritization of inference-based credit assignment
title Efficiency and prioritization of inference-based credit assignment
title_full Efficiency and prioritization of inference-based credit assignment
title_fullStr Efficiency and prioritization of inference-based credit assignment
title_full_unstemmed Efficiency and prioritization of inference-based credit assignment
title_short Efficiency and prioritization of inference-based credit assignment
title_sort efficiency and prioritization of inference-based credit assignment
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8279739/
https://www.ncbi.nlm.nih.gov/pubmed/33887181
http://dx.doi.org/10.1016/j.cub.2021.03.091
work_keys_str_mv AT moranrani efficiencyandprioritizationofinferencebasedcreditassignment
AT dayanpeter efficiencyandprioritizationofinferencebasedcreditassignment
AT dolanraymondj efficiencyandprioritizationofinferencebasedcreditassignment