Cargando…
Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma
When making decisions, should one exploit known good options or explore potentially better alternatives? Exploration of spatially unstructured options depends on the neocortex, striatum, and amygdala. In natural environments, however, better options often cluster together, forming structured value d...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7589536/ https://www.ncbi.nlm.nih.gov/pubmed/33106508 http://dx.doi.org/10.1038/s41467-020-18864-0 |
_version_ | 1783600600933466112 |
---|---|
author | Dombrovski, Alexandre Y. Luna, Beatriz Hallquist, Michael N. |
author_facet | Dombrovski, Alexandre Y. Luna, Beatriz Hallquist, Michael N. |
author_sort | Dombrovski, Alexandre Y. |
collection | PubMed |
description | When making decisions, should one exploit known good options or explore potentially better alternatives? Exploration of spatially unstructured options depends on the neocortex, striatum, and amygdala. In natural environments, however, better options often cluster together, forming structured value distributions. The hippocampus binds reward information into allocentric cognitive maps to support navigation and foraging in such spaces. Here we report that human posterior hippocampus (PH) invigorates exploration while anterior hippocampus (AH) supports the transition to exploitation on a reinforcement learning task with a spatially structured reward function. These dynamics depend on differential reinforcement representations in the PH and AH. Whereas local reward prediction error signals are early and phasic in the PH tail, global value maximum signals are delayed and sustained in the AH body. AH compresses reinforcement information across episodes, updating the location and prominence of the value maximum and displaying goal cell-like ramping activity when navigating toward it. |
format | Online Article Text |
id | pubmed-7589536 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-75895362020-11-10 Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma Dombrovski, Alexandre Y. Luna, Beatriz Hallquist, Michael N. Nat Commun Article When making decisions, should one exploit known good options or explore potentially better alternatives? Exploration of spatially unstructured options depends on the neocortex, striatum, and amygdala. In natural environments, however, better options often cluster together, forming structured value distributions. The hippocampus binds reward information into allocentric cognitive maps to support navigation and foraging in such spaces. Here we report that human posterior hippocampus (PH) invigorates exploration while anterior hippocampus (AH) supports the transition to exploitation on a reinforcement learning task with a spatially structured reward function. These dynamics depend on differential reinforcement representations in the PH and AH. Whereas local reward prediction error signals are early and phasic in the PH tail, global value maximum signals are delayed and sustained in the AH body. AH compresses reinforcement information across episodes, updating the location and prominence of the value maximum and displaying goal cell-like ramping activity when navigating toward it. Nature Publishing Group UK 2020-10-26 /pmc/articles/PMC7589536/ /pubmed/33106508 http://dx.doi.org/10.1038/s41467-020-18864-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Dombrovski, Alexandre Y. Luna, Beatriz Hallquist, Michael N. Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma |
title | Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma |
title_full | Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma |
title_fullStr | Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma |
title_full_unstemmed | Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma |
title_short | Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma |
title_sort | differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7589536/ https://www.ncbi.nlm.nih.gov/pubmed/33106508 http://dx.doi.org/10.1038/s41467-020-18864-0 |
work_keys_str_mv | AT dombrovskialexandrey differentialreinforcementencodingalongthehippocampallongaxishelpsresolvetheexploreexploitdilemma AT lunabeatriz differentialreinforcementencodingalongthehippocampallongaxishelpsresolvetheexploreexploitdilemma AT hallquistmichaeln differentialreinforcementencodingalongthehippocampallongaxishelpsresolvetheexploreexploitdilemma |