Cargando…

Scaling prediction errors to reward variability benefits error-driven learning in humans

Effective error-driven learning requires individuals to adapt learning to environmental reward variability. The adaptive mechanism may involve decays in learning rate across subsequent trials, as shown previously, and rescaling of reward prediction errors. The present study investigated the influenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Diederen, Kelly M. J., Schultz, Wolfram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Physiological Society 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4563025/
https://www.ncbi.nlm.nih.gov/pubmed/26180123
http://dx.doi.org/10.1152/jn.00483.2015
_version_ 1782389228979617792
author Diederen, Kelly M. J.
Schultz, Wolfram
author_facet Diederen, Kelly M. J.
Schultz, Wolfram
author_sort Diederen, Kelly M. J.
collection PubMed
description Effective error-driven learning requires individuals to adapt learning to environmental reward variability. The adaptive mechanism may involve decays in learning rate across subsequent trials, as shown previously, and rescaling of reward prediction errors. The present study investigated the influence of prediction error scaling and, in particular, the consequences for learning performance. Participants explicitly predicted reward magnitudes that were drawn from different probability distributions with specific standard deviations. By fitting the data with reinforcement learning models, we found scaling of prediction errors, in addition to the learning rate decay shown previously. Importantly, the prediction error scaling was closely related to learning performance, defined as accuracy in predicting the mean of reward distributions, across individual participants. In addition, participants who scaled prediction errors relative to standard deviation also presented with more similar performance for different standard deviations, indicating that increases in standard deviation did not substantially decrease “adapters'” accuracy in predicting the means of reward distributions. However, exaggerated scaling beyond the standard deviation resulted in impaired performance. Thus efficient adaptation makes learning more robust to changing variability.
format Online
Article
Text
id pubmed-4563025
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher American Physiological Society
record_format MEDLINE/PubMed
spelling pubmed-45630252015-09-11 Scaling prediction errors to reward variability benefits error-driven learning in humans Diederen, Kelly M. J. Schultz, Wolfram J Neurophysiol Higher Neural Functions and Behavior Effective error-driven learning requires individuals to adapt learning to environmental reward variability. The adaptive mechanism may involve decays in learning rate across subsequent trials, as shown previously, and rescaling of reward prediction errors. The present study investigated the influence of prediction error scaling and, in particular, the consequences for learning performance. Participants explicitly predicted reward magnitudes that were drawn from different probability distributions with specific standard deviations. By fitting the data with reinforcement learning models, we found scaling of prediction errors, in addition to the learning rate decay shown previously. Importantly, the prediction error scaling was closely related to learning performance, defined as accuracy in predicting the mean of reward distributions, across individual participants. In addition, participants who scaled prediction errors relative to standard deviation also presented with more similar performance for different standard deviations, indicating that increases in standard deviation did not substantially decrease “adapters'” accuracy in predicting the means of reward distributions. However, exaggerated scaling beyond the standard deviation resulted in impaired performance. Thus efficient adaptation makes learning more robust to changing variability. American Physiological Society 2015-07-15 2015-09 /pmc/articles/PMC4563025/ /pubmed/26180123 http://dx.doi.org/10.1152/jn.00483.2015 Text en Copyright © 2015 the American Physiological Society Licensed under Creative Commons Attribution CC-BY 3.0 (http://creativecommons.org/licenses/by/3.0/deed.en_US) : © the American Physiological Society.
spellingShingle Higher Neural Functions and Behavior
Diederen, Kelly M. J.
Schultz, Wolfram
Scaling prediction errors to reward variability benefits error-driven learning in humans
title Scaling prediction errors to reward variability benefits error-driven learning in humans
title_full Scaling prediction errors to reward variability benefits error-driven learning in humans
title_fullStr Scaling prediction errors to reward variability benefits error-driven learning in humans
title_full_unstemmed Scaling prediction errors to reward variability benefits error-driven learning in humans
title_short Scaling prediction errors to reward variability benefits error-driven learning in humans
title_sort scaling prediction errors to reward variability benefits error-driven learning in humans
topic Higher Neural Functions and Behavior
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4563025/
https://www.ncbi.nlm.nih.gov/pubmed/26180123
http://dx.doi.org/10.1152/jn.00483.2015
work_keys_str_mv AT diederenkellymj scalingpredictionerrorstorewardvariabilitybenefitserrordrivenlearninginhumans
AT schultzwolfram scalingpredictionerrorstorewardvariabilitybenefitserrordrivenlearninginhumans