Cargando…

Asymmetric and adaptive reward coding via normalized reinforcement learning

Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational...

Descripción completa

Detalles Bibliográficos
Autor principal: Louie, Kenway
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345478/
https://www.ncbi.nlm.nih.gov/pubmed/35862443
http://dx.doi.org/10.1371/journal.pcbi.1010350
_version_ 1784761442624339968
author Louie, Kenway
author_facet Louie, Kenway
author_sort Louie, Kenway
collection PubMed
description Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making.
format Online
Article
Text
id pubmed-9345478
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-93454782022-08-03 Asymmetric and adaptive reward coding via normalized reinforcement learning Louie, Kenway PLoS Comput Biol Research Article Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making. Public Library of Science 2022-07-21 /pmc/articles/PMC9345478/ /pubmed/35862443 http://dx.doi.org/10.1371/journal.pcbi.1010350 Text en © 2022 Kenway Louie https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Louie, Kenway
Asymmetric and adaptive reward coding via normalized reinforcement learning
title Asymmetric and adaptive reward coding via normalized reinforcement learning
title_full Asymmetric and adaptive reward coding via normalized reinforcement learning
title_fullStr Asymmetric and adaptive reward coding via normalized reinforcement learning
title_full_unstemmed Asymmetric and adaptive reward coding via normalized reinforcement learning
title_short Asymmetric and adaptive reward coding via normalized reinforcement learning
title_sort asymmetric and adaptive reward coding via normalized reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345478/
https://www.ncbi.nlm.nih.gov/pubmed/35862443
http://dx.doi.org/10.1371/journal.pcbi.1010350
work_keys_str_mv AT louiekenway asymmetricandadaptiverewardcodingvianormalizedreinforcementlearning