Cargando…

Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning

Humans routinely face novel environments in which they have to generalize in order to act adaptively. However, doing so involves the non-trivial challenge of deciding which aspects of a task domain to generalize. While it is sometimes appropriate to simply re-use a learned behavior, often adaptive g...

Descripción completa

Detalles Bibliográficos
Autores principales: Franklin, Nicholas T., Frank, Michael J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7179934/
https://www.ncbi.nlm.nih.gov/pubmed/32282795
http://dx.doi.org/10.1371/journal.pcbi.1007720
_version_ 1783525731217702912
author Franklin, Nicholas T.
Frank, Michael J.
author_facet Franklin, Nicholas T.
Frank, Michael J.
author_sort Franklin, Nicholas T.
collection PubMed
description Humans routinely face novel environments in which they have to generalize in order to act adaptively. However, doing so involves the non-trivial challenge of deciding which aspects of a task domain to generalize. While it is sometimes appropriate to simply re-use a learned behavior, often adaptive generalization entails recombining distinct components of knowledge acquired across multiple contexts. Theoretical work has suggested a computational trade-off in which it can be more or less useful to learn and generalize aspects of task structure jointly or compositionally, depending on previous task statistics, but it is unknown whether humans modulate their generalization strategy accordingly. Here we develop a series of navigation tasks that separately manipulate the statistics of goal values (“what to do”) and state transitions (“how to do it”) across contexts and assess whether human subjects generalize these task components separately or conjunctively. We find that human generalization is sensitive to the statistics of the previously experienced task domain, favoring compositional or conjunctive generalization when the task statistics are indicative of such structures, and a mixture of the two when they are more ambiguous. These results support a normative “meta-generalization” account and suggests that people not only generalize previous task components but also generalize the statistical structure most likely to support generalization.
format Online
Article
Text
id pubmed-7179934
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71799342020-05-05 Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning Franklin, Nicholas T. Frank, Michael J. PLoS Comput Biol Research Article Humans routinely face novel environments in which they have to generalize in order to act adaptively. However, doing so involves the non-trivial challenge of deciding which aspects of a task domain to generalize. While it is sometimes appropriate to simply re-use a learned behavior, often adaptive generalization entails recombining distinct components of knowledge acquired across multiple contexts. Theoretical work has suggested a computational trade-off in which it can be more or less useful to learn and generalize aspects of task structure jointly or compositionally, depending on previous task statistics, but it is unknown whether humans modulate their generalization strategy accordingly. Here we develop a series of navigation tasks that separately manipulate the statistics of goal values (“what to do”) and state transitions (“how to do it”) across contexts and assess whether human subjects generalize these task components separately or conjunctively. We find that human generalization is sensitive to the statistics of the previously experienced task domain, favoring compositional or conjunctive generalization when the task statistics are indicative of such structures, and a mixture of the two when they are more ambiguous. These results support a normative “meta-generalization” account and suggests that people not only generalize previous task components but also generalize the statistical structure most likely to support generalization. Public Library of Science 2020-04-13 /pmc/articles/PMC7179934/ /pubmed/32282795 http://dx.doi.org/10.1371/journal.pcbi.1007720 Text en © 2020 Franklin, Frank http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Franklin, Nicholas T.
Frank, Michael J.
Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning
title Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning
title_full Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning
title_fullStr Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning
title_full_unstemmed Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning
title_short Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning
title_sort generalizing to generalize: humans flexibly switch between compositional and conjunctive structures during reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7179934/
https://www.ncbi.nlm.nih.gov/pubmed/32282795
http://dx.doi.org/10.1371/journal.pcbi.1007720
work_keys_str_mv AT franklinnicholast generalizingtogeneralizehumansflexiblyswitchbetweencompositionalandconjunctivestructuresduringreinforcementlearning
AT frankmichaelj generalizingtogeneralizehumansflexiblyswitchbetweencompositionalandconjunctivestructuresduringreinforcementlearning