Cargando…

Entropy-based metrics for predicting choice behavior based on local response to reward

For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedba...

Descripción completa

Detalles Bibliográficos
Autores principales: Trepka, Ethan, Spitmaan, Mehran, Bari, Bilal A., Costa, Vincent D., Cohen, Jeremiah Y., Soltani, Alireza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8590026/
https://www.ncbi.nlm.nih.gov/pubmed/34772943
http://dx.doi.org/10.1038/s41467-021-26784-w
_version_ 1784598863112306688
author Trepka, Ethan
Spitmaan, Mehran
Bari, Bilal A.
Costa, Vincent D.
Cohen, Jeremiah Y.
Soltani, Alireza
author_facet Trepka, Ethan
Spitmaan, Mehran
Bari, Bilal A.
Costa, Vincent D.
Cohen, Jeremiah Y.
Soltani, Alireza
author_sort Trepka, Ethan
collection PubMed
description For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedback over time. Despite reasonable success of RL models in capturing choice on a trial-by-trial basis, these models cannot capture variability in matching behavior. To address this, we developed metrics based on information theory and applied them to choice data from dynamic learning tasks in mice and monkeys. We found that a single entropy-based metric can explain 50% and 41% of variance in matching in mice and monkeys, respectively. We then used limitations of existing RL models in capturing entropy-based metrics to construct more accurate models of choice. Together, our entropy-based metrics provide a model-free tool to predict adaptive choice behavior and reveal underlying neural mechanisms.
format Online
Article
Text
id pubmed-8590026
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85900262021-11-15 Entropy-based metrics for predicting choice behavior based on local response to reward Trepka, Ethan Spitmaan, Mehran Bari, Bilal A. Costa, Vincent D. Cohen, Jeremiah Y. Soltani, Alireza Nat Commun Article For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedback over time. Despite reasonable success of RL models in capturing choice on a trial-by-trial basis, these models cannot capture variability in matching behavior. To address this, we developed metrics based on information theory and applied them to choice data from dynamic learning tasks in mice and monkeys. We found that a single entropy-based metric can explain 50% and 41% of variance in matching in mice and monkeys, respectively. We then used limitations of existing RL models in capturing entropy-based metrics to construct more accurate models of choice. Together, our entropy-based metrics provide a model-free tool to predict adaptive choice behavior and reveal underlying neural mechanisms. Nature Publishing Group UK 2021-11-12 /pmc/articles/PMC8590026/ /pubmed/34772943 http://dx.doi.org/10.1038/s41467-021-26784-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Trepka, Ethan
Spitmaan, Mehran
Bari, Bilal A.
Costa, Vincent D.
Cohen, Jeremiah Y.
Soltani, Alireza
Entropy-based metrics for predicting choice behavior based on local response to reward
title Entropy-based metrics for predicting choice behavior based on local response to reward
title_full Entropy-based metrics for predicting choice behavior based on local response to reward
title_fullStr Entropy-based metrics for predicting choice behavior based on local response to reward
title_full_unstemmed Entropy-based metrics for predicting choice behavior based on local response to reward
title_short Entropy-based metrics for predicting choice behavior based on local response to reward
title_sort entropy-based metrics for predicting choice behavior based on local response to reward
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8590026/
https://www.ncbi.nlm.nih.gov/pubmed/34772943
http://dx.doi.org/10.1038/s41467-021-26784-w
work_keys_str_mv AT trepkaethan entropybasedmetricsforpredictingchoicebehaviorbasedonlocalresponsetoreward
AT spitmaanmehran entropybasedmetricsforpredictingchoicebehaviorbasedonlocalresponsetoreward
AT baribilala entropybasedmetricsforpredictingchoicebehaviorbasedonlocalresponsetoreward
AT costavincentd entropybasedmetricsforpredictingchoicebehaviorbasedonlocalresponsetoreward
AT cohenjeremiahy entropybasedmetricsforpredictingchoicebehaviorbasedonlocalresponsetoreward
AT soltanialireza entropybasedmetricsforpredictingchoicebehaviorbasedonlocalresponsetoreward