Cargando…

How pupil responses track value-based decision-making during and after reinforcement learning

Cognition can reveal itself in the pupil, as latent cognitive processes map onto specific pupil responses. For instance, the pupil dilates when we make decisions and these pupil size fluctuations reflect decision-making computations during and after a choice. Surprisingly little is known, however, a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Van Slooten, Joanne C., Jahfari, Sara, Knapen, Tomas, Theeuwes, Jan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6291167/ https://www.ncbi.nlm.nih.gov/pubmed/30500813 http://dx.doi.org/10.1371/journal.pcbi.1006632

_version_	1783380218372685824
author	Van Slooten, Joanne C. Jahfari, Sara Knapen, Tomas Theeuwes, Jan
author_facet	Van Slooten, Joanne C. Jahfari, Sara Knapen, Tomas Theeuwes, Jan
author_sort	Van Slooten, Joanne C.
collection	PubMed
description	Cognition can reveal itself in the pupil, as latent cognitive processes map onto specific pupil responses. For instance, the pupil dilates when we make decisions and these pupil size fluctuations reflect decision-making computations during and after a choice. Surprisingly little is known, however, about how pupil responses relate to decisions driven by the learned value of stimuli. This understanding is important, as most real-life decisions are guided by the outcomes of earlier choices. The goal of this study was to investigate which cognitive processes the pupil reflects during value-based decision-making. We used a reinforcement learning task to study pupil responses during value-based decisions and subsequent decision evaluations, employing computational modeling to quantitatively describe the underlying cognitive processes. We found that the pupil closely tracks reinforcement learning processes independently across participants and across trials. Prior to choice, the pupil dilated as a function of trial-by-trial fluctuations in value beliefs about the to-be chosen option and predicted an individual’s tendency to exploit high value options. After feedback a biphasic pupil response was observed, the amplitude of which correlated with participants’ learning rates. Furthermore, across trials, early feedback-related dilation scaled with value uncertainty, whereas later constriction scaled with signed reward prediction errors. These findings show that pupil size fluctuations can provide detailed information about the computations underlying value-based decisions and the subsequent updating of value beliefs. As these processes are affected in a host of psychiatric disorders, our results indicate that pupillometry can be used as an accessible tool to non-invasively study the processes underlying ongoing reinforcement learning in the clinic.
format	Online Article Text
id	pubmed-6291167
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-62911672018-12-28 How pupil responses track value-based decision-making during and after reinforcement learning Van Slooten, Joanne C. Jahfari, Sara Knapen, Tomas Theeuwes, Jan PLoS Comput Biol Research Article Cognition can reveal itself in the pupil, as latent cognitive processes map onto specific pupil responses. For instance, the pupil dilates when we make decisions and these pupil size fluctuations reflect decision-making computations during and after a choice. Surprisingly little is known, however, about how pupil responses relate to decisions driven by the learned value of stimuli. This understanding is important, as most real-life decisions are guided by the outcomes of earlier choices. The goal of this study was to investigate which cognitive processes the pupil reflects during value-based decision-making. We used a reinforcement learning task to study pupil responses during value-based decisions and subsequent decision evaluations, employing computational modeling to quantitatively describe the underlying cognitive processes. We found that the pupil closely tracks reinforcement learning processes independently across participants and across trials. Prior to choice, the pupil dilated as a function of trial-by-trial fluctuations in value beliefs about the to-be chosen option and predicted an individual’s tendency to exploit high value options. After feedback a biphasic pupil response was observed, the amplitude of which correlated with participants’ learning rates. Furthermore, across trials, early feedback-related dilation scaled with value uncertainty, whereas later constriction scaled with signed reward prediction errors. These findings show that pupil size fluctuations can provide detailed information about the computations underlying value-based decisions and the subsequent updating of value beliefs. As these processes are affected in a host of psychiatric disorders, our results indicate that pupillometry can be used as an accessible tool to non-invasively study the processes underlying ongoing reinforcement learning in the clinic. Public Library of Science 2018-11-30 /pmc/articles/PMC6291167/ /pubmed/30500813 http://dx.doi.org/10.1371/journal.pcbi.1006632 Text en © 2018 Van Slooten et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Van Slooten, Joanne C. Jahfari, Sara Knapen, Tomas Theeuwes, Jan How pupil responses track value-based decision-making during and after reinforcement learning
title	How pupil responses track value-based decision-making during and after reinforcement learning
title_full	How pupil responses track value-based decision-making during and after reinforcement learning
title_fullStr	How pupil responses track value-based decision-making during and after reinforcement learning
title_full_unstemmed	How pupil responses track value-based decision-making during and after reinforcement learning
title_short	How pupil responses track value-based decision-making during and after reinforcement learning
title_sort	how pupil responses track value-based decision-making during and after reinforcement learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6291167/ https://www.ncbi.nlm.nih.gov/pubmed/30500813 http://dx.doi.org/10.1371/journal.pcbi.1006632
work_keys_str_mv	AT vanslootenjoannec howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning AT jahfarisara howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning AT knapentomas howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning AT theeuwesjan howpupilresponsestrackvaluebaseddecisionmakingduringandafterreinforcementlearning

How pupil responses track value-based decision-making during and after reinforcement learning

Ejemplares similares