Cargando…
A Reinforcement Learning Model of Precommitment in Decision Making
Addiction and many other disorders are linked to impulsivity, where a suboptimal choice is preferred when it is immediately available. One solution to impulsivity is precommitment: constraining one's future to avoid being offered a suboptimal choice. A form of impulsivity can be measured experi...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Frontiers Research Foundation
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004435/ https://www.ncbi.nlm.nih.gov/pubmed/21179584 http://dx.doi.org/10.3389/fnbeh.2010.00184 |
_version_ | 1782193980475179008 |
---|---|
author | Kurth-Nelson, Zeb Redish, A. David |
author_facet | Kurth-Nelson, Zeb Redish, A. David |
author_sort | Kurth-Nelson, Zeb |
collection | PubMed |
description | Addiction and many other disorders are linked to impulsivity, where a suboptimal choice is preferred when it is immediately available. One solution to impulsivity is precommitment: constraining one's future to avoid being offered a suboptimal choice. A form of impulsivity can be measured experimentally by offering a choice between a smaller reward delivered sooner and a larger reward delivered later. Impulsive subjects are more likely to select the smaller-sooner choice; however, when offered an option to precommit, even impulsive subjects can precommit to the larger-later choice. To precommit or not is a decision between two conditions: (A) the original choice (smaller-sooner vs. larger-later), and (B) a new condition with only larger-later available. It has been observed that precommitment appears as a consequence of the preference reversal inherent in non-exponential delay-discounting. Here we show that most models of hyperbolic discounting cannot precommit, but a distributed model of hyperbolic discounting does precommit. Using this model, we find (1) faster discounters may be more or less likely than slow discounters to precommit, depending on the precommitment delay, (2) for a constant smaller-sooner vs. larger-later preference, a higher ratio of larger reward to smaller reward increases the probability of precommitment, and (3) precommitment is highly sensitive to the shape of the discount curve. These predictions imply that manipulations that alter the discount curve, such as diet or context, may qualitatively affect precommitment. |
format | Text |
id | pubmed-3004435 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Frontiers Research Foundation |
record_format | MEDLINE/PubMed |
spelling | pubmed-30044352010-12-21 A Reinforcement Learning Model of Precommitment in Decision Making Kurth-Nelson, Zeb Redish, A. David Front Behav Neurosci Neuroscience Addiction and many other disorders are linked to impulsivity, where a suboptimal choice is preferred when it is immediately available. One solution to impulsivity is precommitment: constraining one's future to avoid being offered a suboptimal choice. A form of impulsivity can be measured experimentally by offering a choice between a smaller reward delivered sooner and a larger reward delivered later. Impulsive subjects are more likely to select the smaller-sooner choice; however, when offered an option to precommit, even impulsive subjects can precommit to the larger-later choice. To precommit or not is a decision between two conditions: (A) the original choice (smaller-sooner vs. larger-later), and (B) a new condition with only larger-later available. It has been observed that precommitment appears as a consequence of the preference reversal inherent in non-exponential delay-discounting. Here we show that most models of hyperbolic discounting cannot precommit, but a distributed model of hyperbolic discounting does precommit. Using this model, we find (1) faster discounters may be more or less likely than slow discounters to precommit, depending on the precommitment delay, (2) for a constant smaller-sooner vs. larger-later preference, a higher ratio of larger reward to smaller reward increases the probability of precommitment, and (3) precommitment is highly sensitive to the shape of the discount curve. These predictions imply that manipulations that alter the discount curve, such as diet or context, may qualitatively affect precommitment. Frontiers Research Foundation 2010-12-14 /pmc/articles/PMC3004435/ /pubmed/21179584 http://dx.doi.org/10.3389/fnbeh.2010.00184 Text en Copyright © 2010 Kurth-Nelson and Redish. http://www.frontiersin.org/licenseagreement This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited. |
spellingShingle | Neuroscience Kurth-Nelson, Zeb Redish, A. David A Reinforcement Learning Model of Precommitment in Decision Making |
title | A Reinforcement Learning Model of Precommitment in Decision Making |
title_full | A Reinforcement Learning Model of Precommitment in Decision Making |
title_fullStr | A Reinforcement Learning Model of Precommitment in Decision Making |
title_full_unstemmed | A Reinforcement Learning Model of Precommitment in Decision Making |
title_short | A Reinforcement Learning Model of Precommitment in Decision Making |
title_sort | reinforcement learning model of precommitment in decision making |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004435/ https://www.ncbi.nlm.nih.gov/pubmed/21179584 http://dx.doi.org/10.3389/fnbeh.2010.00184 |
work_keys_str_mv | AT kurthnelsonzeb areinforcementlearningmodelofprecommitmentindecisionmaking AT redishadavid areinforcementlearningmodelofprecommitmentindecisionmaking AT kurthnelsonzeb reinforcementlearningmodelofprecommitmentindecisionmaking AT redishadavid reinforcementlearningmodelofprecommitmentindecisionmaking |