Cargando…

Confidence modulates exploration and exploitation in value-based learning

Uncertainty is ubiquitous in cognitive processing. In this study, we aim to investigate the ability agents possess to track and report the noise inherent in their mental operations, often in the form of confidence judgments. Here, we argue that humans can use uncertainty inherent in their representa...

Descripción completa

Detalles Bibliográficos
Autores principales: Boldt, Annika, Blundell, Charles, De Martino, Benedetto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505439/
https://www.ncbi.nlm.nih.gov/pubmed/31086679
http://dx.doi.org/10.1093/nc/niz004
_version_ 1783416758081683456
author Boldt, Annika
Blundell, Charles
De Martino, Benedetto
author_facet Boldt, Annika
Blundell, Charles
De Martino, Benedetto
author_sort Boldt, Annika
collection PubMed
description Uncertainty is ubiquitous in cognitive processing. In this study, we aim to investigate the ability agents possess to track and report the noise inherent in their mental operations, often in the form of confidence judgments. Here, we argue that humans can use uncertainty inherent in their representations of value beliefs to arbitrate between exploration and exploitation. Such uncertainty is reflected in explicit confidence judgments. Using a novel variant of a multi-armed bandit paradigm, we studied how beliefs were formed and how uncertainty in the encoding of these value beliefs (belief confidence) evolved over time. We found that people used uncertainty to arbitrate between exploration and exploitation, reflected in a higher tendency toward exploration when their confidence in their value representations was low. We furthermore found that value uncertainty can be linked to frameworks of metacognition in decision making in two ways. First, belief confidence drives decision confidence, i.e. people’s evaluation of their own choices. Second, individuals with higher metacognitive insight into their choices were also better at tracing the uncertainty in their environment. Together, these findings argue that such uncertainty representations play a key role in the context of cognitive control.
format Online
Article
Text
id pubmed-6505439
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65054392019-05-13 Confidence modulates exploration and exploitation in value-based learning Boldt, Annika Blundell, Charles De Martino, Benedetto Neurosci Conscious Research Article Uncertainty is ubiquitous in cognitive processing. In this study, we aim to investigate the ability agents possess to track and report the noise inherent in their mental operations, often in the form of confidence judgments. Here, we argue that humans can use uncertainty inherent in their representations of value beliefs to arbitrate between exploration and exploitation. Such uncertainty is reflected in explicit confidence judgments. Using a novel variant of a multi-armed bandit paradigm, we studied how beliefs were formed and how uncertainty in the encoding of these value beliefs (belief confidence) evolved over time. We found that people used uncertainty to arbitrate between exploration and exploitation, reflected in a higher tendency toward exploration when their confidence in their value representations was low. We furthermore found that value uncertainty can be linked to frameworks of metacognition in decision making in two ways. First, belief confidence drives decision confidence, i.e. people’s evaluation of their own choices. Second, individuals with higher metacognitive insight into their choices were also better at tracing the uncertainty in their environment. Together, these findings argue that such uncertainty representations play a key role in the context of cognitive control. Oxford University Press 2019-05-08 /pmc/articles/PMC6505439/ /pubmed/31086679 http://dx.doi.org/10.1093/nc/niz004 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Boldt, Annika
Blundell, Charles
De Martino, Benedetto
Confidence modulates exploration and exploitation in value-based learning
title Confidence modulates exploration and exploitation in value-based learning
title_full Confidence modulates exploration and exploitation in value-based learning
title_fullStr Confidence modulates exploration and exploitation in value-based learning
title_full_unstemmed Confidence modulates exploration and exploitation in value-based learning
title_short Confidence modulates exploration and exploitation in value-based learning
title_sort confidence modulates exploration and exploitation in value-based learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505439/
https://www.ncbi.nlm.nih.gov/pubmed/31086679
http://dx.doi.org/10.1093/nc/niz004
work_keys_str_mv AT boldtannika confidencemodulatesexplorationandexploitationinvaluebasedlearning
AT blundellcharles confidencemodulatesexplorationandexploitationinvaluebasedlearning
AT demartinobenedetto confidencemodulatesexplorationandexploitationinvaluebasedlearning