Cargando…
Time elapsed between choices in a probabilistic task correlates with repeating the same decision
Reinforcement learning causes an action that yields a positive outcome more likely to be taken in the future. Here, we investigate how the time elapsed from an action affects subsequent decisions. Groups of C57BL6/J mice were housed in IntelliCages with access to water and chow ad libitum; they also...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248175/ https://www.ncbi.nlm.nih.gov/pubmed/33559232 http://dx.doi.org/10.1111/ejn.15144 |
_version_ | 1783716672672104448 |
---|---|
author | Jabłońska, Judyta Szumiec, Łukasz Zieliński, Piotr Rodriguez Parkitna, Jan |
author_facet | Jabłońska, Judyta Szumiec, Łukasz Zieliński, Piotr Rodriguez Parkitna, Jan |
author_sort | Jabłońska, Judyta |
collection | PubMed |
description | Reinforcement learning causes an action that yields a positive outcome more likely to be taken in the future. Here, we investigate how the time elapsed from an action affects subsequent decisions. Groups of C57BL6/J mice were housed in IntelliCages with access to water and chow ad libitum; they also had access to bottles with a reward: saccharin solution, alcohol, or a mixture of the two. The probability of receiving a reward in two of the cage corners changed between 0.9 and 0.3 every 48 hr over a period of ~33 days. As expected, in most animals, the odds of repeating a corner choice were increased if that choice was previously rewarded. Interestingly, the time elapsed from the previous choice also influenced the probability of repeating the choice, and this effect was independent of previous outcome. Behavioral data were fitted to a series of reinforcement learning models. Best fits were achieved when the reward prediction update was coupled with separate learning rates from positive and negative outcomes and additionally a “fictitious” update of the expected value of the nonselected choice. Additional inclusion of a time‐dependent decay of the expected values improved the fit marginally in some cases. |
format | Online Article Text |
id | pubmed-8248175 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82481752021-07-06 Time elapsed between choices in a probabilistic task correlates with repeating the same decision Jabłońska, Judyta Szumiec, Łukasz Zieliński, Piotr Rodriguez Parkitna, Jan Eur J Neurosci Cognitive Neuroscience Reinforcement learning causes an action that yields a positive outcome more likely to be taken in the future. Here, we investigate how the time elapsed from an action affects subsequent decisions. Groups of C57BL6/J mice were housed in IntelliCages with access to water and chow ad libitum; they also had access to bottles with a reward: saccharin solution, alcohol, or a mixture of the two. The probability of receiving a reward in two of the cage corners changed between 0.9 and 0.3 every 48 hr over a period of ~33 days. As expected, in most animals, the odds of repeating a corner choice were increased if that choice was previously rewarded. Interestingly, the time elapsed from the previous choice also influenced the probability of repeating the choice, and this effect was independent of previous outcome. Behavioral data were fitted to a series of reinforcement learning models. Best fits were achieved when the reward prediction update was coupled with separate learning rates from positive and negative outcomes and additionally a “fictitious” update of the expected value of the nonselected choice. Additional inclusion of a time‐dependent decay of the expected values improved the fit marginally in some cases. John Wiley and Sons Inc. 2021-03-02 2021-04 /pmc/articles/PMC8248175/ /pubmed/33559232 http://dx.doi.org/10.1111/ejn.15144 Text en © 2021 The Authors. European Journal of Neuroscience published by Federation of European Neuroscience Societies and John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Cognitive Neuroscience Jabłońska, Judyta Szumiec, Łukasz Zieliński, Piotr Rodriguez Parkitna, Jan Time elapsed between choices in a probabilistic task correlates with repeating the same decision |
title | Time elapsed between choices in a probabilistic task correlates with repeating the same decision |
title_full | Time elapsed between choices in a probabilistic task correlates with repeating the same decision |
title_fullStr | Time elapsed between choices in a probabilistic task correlates with repeating the same decision |
title_full_unstemmed | Time elapsed between choices in a probabilistic task correlates with repeating the same decision |
title_short | Time elapsed between choices in a probabilistic task correlates with repeating the same decision |
title_sort | time elapsed between choices in a probabilistic task correlates with repeating the same decision |
topic | Cognitive Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248175/ https://www.ncbi.nlm.nih.gov/pubmed/33559232 http://dx.doi.org/10.1111/ejn.15144 |
work_keys_str_mv | AT jabłonskajudyta timeelapsedbetweenchoicesinaprobabilistictaskcorrelateswithrepeatingthesamedecision AT szumiecłukasz timeelapsedbetweenchoicesinaprobabilistictaskcorrelateswithrepeatingthesamedecision AT zielinskipiotr timeelapsedbetweenchoicesinaprobabilistictaskcorrelateswithrepeatingthesamedecision AT rodriguezparkitnajan timeelapsedbetweenchoicesinaprobabilistictaskcorrelateswithrepeatingthesamedecision |