Cargando…

Dopamine Reward Prediction Error Responses Reflect Marginal Utility

BACKGROUND: Optimal choices require an accurate neuronal representation of economic value. In economics, utility functions are mathematical representations of subjective value that can be constructed from choices under risk. Utility usually exhibits a nonlinear relationship to physical reward value...

Descripción completa

Detalles Bibliográficos
Autores principales: Stauffer, William R., Lak, Armin, Schultz, Wolfram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cell Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4228052/
https://www.ncbi.nlm.nih.gov/pubmed/25283778
http://dx.doi.org/10.1016/j.cub.2014.08.064
_version_ 1782343904329203712
author Stauffer, William R.
Lak, Armin
Schultz, Wolfram
author_facet Stauffer, William R.
Lak, Armin
Schultz, Wolfram
author_sort Stauffer, William R.
collection PubMed
description BACKGROUND: Optimal choices require an accurate neuronal representation of economic value. In economics, utility functions are mathematical representations of subjective value that can be constructed from choices under risk. Utility usually exhibits a nonlinear relationship to physical reward value that corresponds to risk attitudes and reflects the increasing or decreasing marginal utility obtained with each additional unit of reward. Accordingly, neuronal reward responses coding utility should robustly reflect this nonlinearity. RESULTS: In two monkeys, we measured utility as a function of physical reward value from meaningful choices under risk (that adhered to first- and second-order stochastic dominance). The resulting nonlinear utility functions predicted the certainty equivalents for new gambles, indicating that the functions’ shapes were meaningful. The monkeys were risk seeking (convex utility function) for low reward and risk avoiding (concave utility function) with higher amounts. Critically, the dopamine prediction error responses at the time of reward itself reflected the nonlinear utility functions measured at the time of choices. In particular, the reward response magnitude depended on the first derivative of the utility function and thus reflected the marginal utility. Furthermore, dopamine responses recorded outside of the task reflected the marginal utility of unpredicted reward. Accordingly, these responses were sufficient to train reinforcement learning models to predict the behaviorally defined expected utility of gambles. CONCLUSIONS: These data suggest a neuronal manifestation of marginal utility in dopamine neurons and indicate a common neuronal basis for fundamental explanatory constructs in animal learning theory (prediction error) and economic decision theory (marginal utility).
format Online
Article
Text
id pubmed-4228052
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Cell Press
record_format MEDLINE/PubMed
spelling pubmed-42280522014-11-13 Dopamine Reward Prediction Error Responses Reflect Marginal Utility Stauffer, William R. Lak, Armin Schultz, Wolfram Curr Biol Article BACKGROUND: Optimal choices require an accurate neuronal representation of economic value. In economics, utility functions are mathematical representations of subjective value that can be constructed from choices under risk. Utility usually exhibits a nonlinear relationship to physical reward value that corresponds to risk attitudes and reflects the increasing or decreasing marginal utility obtained with each additional unit of reward. Accordingly, neuronal reward responses coding utility should robustly reflect this nonlinearity. RESULTS: In two monkeys, we measured utility as a function of physical reward value from meaningful choices under risk (that adhered to first- and second-order stochastic dominance). The resulting nonlinear utility functions predicted the certainty equivalents for new gambles, indicating that the functions’ shapes were meaningful. The monkeys were risk seeking (convex utility function) for low reward and risk avoiding (concave utility function) with higher amounts. Critically, the dopamine prediction error responses at the time of reward itself reflected the nonlinear utility functions measured at the time of choices. In particular, the reward response magnitude depended on the first derivative of the utility function and thus reflected the marginal utility. Furthermore, dopamine responses recorded outside of the task reflected the marginal utility of unpredicted reward. Accordingly, these responses were sufficient to train reinforcement learning models to predict the behaviorally defined expected utility of gambles. CONCLUSIONS: These data suggest a neuronal manifestation of marginal utility in dopamine neurons and indicate a common neuronal basis for fundamental explanatory constructs in animal learning theory (prediction error) and economic decision theory (marginal utility). Cell Press 2014-11-03 /pmc/articles/PMC4228052/ /pubmed/25283778 http://dx.doi.org/10.1016/j.cub.2014.08.064 Text en © 2014 The Authors https://creativecommons.org/licenses/by/3.0/This work is licensed under a Creative Commons Attribution 3.0 Unported License (https://creativecommons.org/licenses/by/3.0/) .
spellingShingle Article
Stauffer, William R.
Lak, Armin
Schultz, Wolfram
Dopamine Reward Prediction Error Responses Reflect Marginal Utility
title Dopamine Reward Prediction Error Responses Reflect Marginal Utility
title_full Dopamine Reward Prediction Error Responses Reflect Marginal Utility
title_fullStr Dopamine Reward Prediction Error Responses Reflect Marginal Utility
title_full_unstemmed Dopamine Reward Prediction Error Responses Reflect Marginal Utility
title_short Dopamine Reward Prediction Error Responses Reflect Marginal Utility
title_sort dopamine reward prediction error responses reflect marginal utility
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4228052/
https://www.ncbi.nlm.nih.gov/pubmed/25283778
http://dx.doi.org/10.1016/j.cub.2014.08.064
work_keys_str_mv AT staufferwilliamr dopaminerewardpredictionerrorresponsesreflectmarginalutility
AT lakarmin dopaminerewardpredictionerrorresponsesreflectmarginalutility
AT schultzwolfram dopaminerewardpredictionerrorresponsesreflectmarginalutility