Cargando…
Asymmetric and adaptive reward coding via normalized reinforcement learning
Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345478/ https://www.ncbi.nlm.nih.gov/pubmed/35862443 http://dx.doi.org/10.1371/journal.pcbi.1010350 |
_version_ | 1784761442624339968 |
---|---|
author | Louie, Kenway |
author_facet | Louie, Kenway |
author_sort | Louie, Kenway |
collection | PubMed |
description | Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making. |
format | Online Article Text |
id | pubmed-9345478 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-93454782022-08-03 Asymmetric and adaptive reward coding via normalized reinforcement learning Louie, Kenway PLoS Comput Biol Research Article Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making. Public Library of Science 2022-07-21 /pmc/articles/PMC9345478/ /pubmed/35862443 http://dx.doi.org/10.1371/journal.pcbi.1010350 Text en © 2022 Kenway Louie https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Louie, Kenway Asymmetric and adaptive reward coding via normalized reinforcement learning |
title | Asymmetric and adaptive reward coding via normalized reinforcement learning |
title_full | Asymmetric and adaptive reward coding via normalized reinforcement learning |
title_fullStr | Asymmetric and adaptive reward coding via normalized reinforcement learning |
title_full_unstemmed | Asymmetric and adaptive reward coding via normalized reinforcement learning |
title_short | Asymmetric and adaptive reward coding via normalized reinforcement learning |
title_sort | asymmetric and adaptive reward coding via normalized reinforcement learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345478/ https://www.ncbi.nlm.nih.gov/pubmed/35862443 http://dx.doi.org/10.1371/journal.pcbi.1010350 |
work_keys_str_mv | AT louiekenway asymmetricandadaptiverewardcodingvianormalizedreinforcementlearning |