Cargando…

Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon

Learning from successes and failures often improves the quality of subsequent decisions. Past outcomes, however, should not influence purely perceptual decisions after task acquisition is complete since these are designed so that only sensory evidence determines the correct choice. Yet, numerous stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Lak, Armin, Hueske, Emily, Hirokawa, Junya, Masset, Paul, Ott, Torben, Urai, Anne E, Donner, Tobias H, Carandini, Matteo, Tonegawa, Susumu, Uchida, Naoshige, Kepecs, Adam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7213979/
https://www.ncbi.nlm.nih.gov/pubmed/32286227
http://dx.doi.org/10.7554/eLife.49834
_version_ 1783531893038252032
author Lak, Armin
Hueske, Emily
Hirokawa, Junya
Masset, Paul
Ott, Torben
Urai, Anne E
Donner, Tobias H
Carandini, Matteo
Tonegawa, Susumu
Uchida, Naoshige
Kepecs, Adam
author_facet Lak, Armin
Hueske, Emily
Hirokawa, Junya
Masset, Paul
Ott, Torben
Urai, Anne E
Donner, Tobias H
Carandini, Matteo
Tonegawa, Susumu
Uchida, Naoshige
Kepecs, Adam
author_sort Lak, Armin
collection PubMed
description Learning from successes and failures often improves the quality of subsequent decisions. Past outcomes, however, should not influence purely perceptual decisions after task acquisition is complete since these are designed so that only sensory evidence determines the correct choice. Yet, numerous studies report that outcomes can bias perceptual decisions, causing spurious changes in choice behavior without improving accuracy. Here we show that the effects of reward on perceptual decisions are principled: past rewards bias future choices specifically when previous choice was difficult and hence decision confidence was low. We identified this phenomenon in six datasets from four laboratories, across mice, rats, and humans, and sensory modalities from olfaction and audition to vision. We show that this choice-updating strategy can be explained by reinforcement learning models incorporating statistical decision confidence into their teaching signals. Thus, reinforcement learning mechanisms are continually engaged to produce systematic adjustments of choices even in well-learned perceptual decisions in order to optimize behavior in an uncertain world.
format Online
Article
Text
id pubmed-7213979
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-72139792020-05-13 Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon Lak, Armin Hueske, Emily Hirokawa, Junya Masset, Paul Ott, Torben Urai, Anne E Donner, Tobias H Carandini, Matteo Tonegawa, Susumu Uchida, Naoshige Kepecs, Adam eLife Neuroscience Learning from successes and failures often improves the quality of subsequent decisions. Past outcomes, however, should not influence purely perceptual decisions after task acquisition is complete since these are designed so that only sensory evidence determines the correct choice. Yet, numerous studies report that outcomes can bias perceptual decisions, causing spurious changes in choice behavior without improving accuracy. Here we show that the effects of reward on perceptual decisions are principled: past rewards bias future choices specifically when previous choice was difficult and hence decision confidence was low. We identified this phenomenon in six datasets from four laboratories, across mice, rats, and humans, and sensory modalities from olfaction and audition to vision. We show that this choice-updating strategy can be explained by reinforcement learning models incorporating statistical decision confidence into their teaching signals. Thus, reinforcement learning mechanisms are continually engaged to produce systematic adjustments of choices even in well-learned perceptual decisions in order to optimize behavior in an uncertain world. eLife Sciences Publications, Ltd 2020-04-15 /pmc/articles/PMC7213979/ /pubmed/32286227 http://dx.doi.org/10.7554/eLife.49834 Text en © 2020, Lak et al http://creativecommons.org/licenses/by/4.0/ http://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Neuroscience
Lak, Armin
Hueske, Emily
Hirokawa, Junya
Masset, Paul
Ott, Torben
Urai, Anne E
Donner, Tobias H
Carandini, Matteo
Tonegawa, Susumu
Uchida, Naoshige
Kepecs, Adam
Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
title Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
title_full Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
title_fullStr Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
title_full_unstemmed Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
title_short Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
title_sort reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7213979/
https://www.ncbi.nlm.nih.gov/pubmed/32286227
http://dx.doi.org/10.7554/eLife.49834
work_keys_str_mv AT lakarmin reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT hueskeemily reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT hirokawajunya reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT massetpaul reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT otttorben reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT uraiannee reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT donnertobiash reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT carandinimatteo reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT tonegawasusumu reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT uchidanaoshige reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon
AT kepecsadam reinforcementbiasessubsequentperceptualdecisionswhenconfidenceislowawidespreadbehavioralphenomenon