Cargando…

Asymmetric and adaptive reward coding via normalized reinforcement learning

Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational...

Descripción completa

Detalles Bibliográficos
Autor principal:	Louie, Kenway
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345478/ https://www.ncbi.nlm.nih.gov/pubmed/35862443 http://dx.doi.org/10.1371/journal.pcbi.1010350

_version_	1784761442624339968
author	Louie, Kenway
author_facet	Louie, Kenway
author_sort	Louie, Kenway
collection	PubMed
description	Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making.
format	Online Article Text
id	pubmed-9345478
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-93454782022-08-03 Asymmetric and adaptive reward coding via normalized reinforcement learning Louie, Kenway PLoS Comput Biol Research Article Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making. Public Library of Science 2022-07-21 /pmc/articles/PMC9345478/ /pubmed/35862443 http://dx.doi.org/10.1371/journal.pcbi.1010350 Text en © 2022 Kenway Louie https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Louie, Kenway Asymmetric and adaptive reward coding via normalized reinforcement learning
title	Asymmetric and adaptive reward coding via normalized reinforcement learning
title_full	Asymmetric and adaptive reward coding via normalized reinforcement learning
title_fullStr	Asymmetric and adaptive reward coding via normalized reinforcement learning
title_full_unstemmed	Asymmetric and adaptive reward coding via normalized reinforcement learning
title_short	Asymmetric and adaptive reward coding via normalized reinforcement learning
title_sort	asymmetric and adaptive reward coding via normalized reinforcement learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345478/ https://www.ncbi.nlm.nih.gov/pubmed/35862443 http://dx.doi.org/10.1371/journal.pcbi.1010350
work_keys_str_mv	AT louiekenway asymmetricandadaptiverewardcodingvianormalizedreinforcementlearning

Asymmetric and adaptive reward coding via normalized reinforcement learning

Ejemplares similares