Cargando…
The Computational Development of Reinforcement Learning during Adolescence
Adolescence is a period of life characterised by changes in learning and decision-making. Learning and decision-making do not rely on a unitary system, but instead require the coordination of different cognitive processes that can be mathematically formalised as dissociable computational modules. He...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4920542/ https://www.ncbi.nlm.nih.gov/pubmed/27322574 http://dx.doi.org/10.1371/journal.pcbi.1004953 |
_version_ | 1782439412706050048 |
---|---|
author | Palminteri, Stefano Kilford, Emma J. Coricelli, Giorgio Blakemore, Sarah-Jayne |
author_facet | Palminteri, Stefano Kilford, Emma J. Coricelli, Giorgio Blakemore, Sarah-Jayne |
author_sort | Palminteri, Stefano |
collection | PubMed |
description | Adolescence is a period of life characterised by changes in learning and decision-making. Learning and decision-making do not rely on a unitary system, but instead require the coordination of different cognitive processes that can be mathematically formalised as dissociable computational modules. Here, we aimed to trace the developmental time-course of the computational modules responsible for learning from reward or punishment, and learning from counterfactual feedback. Adolescents and adults carried out a novel reinforcement learning paradigm in which participants learned the association between cues and probabilistic outcomes, where the outcomes differed in valence (reward versus punishment) and feedback was either partial or complete (either the outcome of the chosen option only, or the outcomes of both the chosen and unchosen option, were displayed). Computational strategies changed during development: whereas adolescents’ behaviour was better explained by a basic reinforcement learning algorithm, adults’ behaviour integrated increasingly complex computational features, namely a counterfactual learning module (enabling enhanced performance in the presence of complete feedback) and a value contextualisation module (enabling symmetrical reward and punishment learning). Unlike adults, adolescent performance did not benefit from counterfactual (complete) feedback. In addition, while adults learned symmetrically from both reward and punishment, adolescents learned from reward but were less likely to learn from punishment. This tendency to rely on rewards and not to consider alternative consequences of actions might contribute to our understanding of decision-making in adolescence. |
format | Online Article Text |
id | pubmed-4920542 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-49205422016-07-18 The Computational Development of Reinforcement Learning during Adolescence Palminteri, Stefano Kilford, Emma J. Coricelli, Giorgio Blakemore, Sarah-Jayne PLoS Comput Biol Research Article Adolescence is a period of life characterised by changes in learning and decision-making. Learning and decision-making do not rely on a unitary system, but instead require the coordination of different cognitive processes that can be mathematically formalised as dissociable computational modules. Here, we aimed to trace the developmental time-course of the computational modules responsible for learning from reward or punishment, and learning from counterfactual feedback. Adolescents and adults carried out a novel reinforcement learning paradigm in which participants learned the association between cues and probabilistic outcomes, where the outcomes differed in valence (reward versus punishment) and feedback was either partial or complete (either the outcome of the chosen option only, or the outcomes of both the chosen and unchosen option, were displayed). Computational strategies changed during development: whereas adolescents’ behaviour was better explained by a basic reinforcement learning algorithm, adults’ behaviour integrated increasingly complex computational features, namely a counterfactual learning module (enabling enhanced performance in the presence of complete feedback) and a value contextualisation module (enabling symmetrical reward and punishment learning). Unlike adults, adolescent performance did not benefit from counterfactual (complete) feedback. In addition, while adults learned symmetrically from both reward and punishment, adolescents learned from reward but were less likely to learn from punishment. This tendency to rely on rewards and not to consider alternative consequences of actions might contribute to our understanding of decision-making in adolescence. Public Library of Science 2016-06-20 /pmc/articles/PMC4920542/ /pubmed/27322574 http://dx.doi.org/10.1371/journal.pcbi.1004953 Text en © 2016 Palminteri et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Palminteri, Stefano Kilford, Emma J. Coricelli, Giorgio Blakemore, Sarah-Jayne The Computational Development of Reinforcement Learning during Adolescence |
title | The Computational Development of Reinforcement Learning during Adolescence |
title_full | The Computational Development of Reinforcement Learning during Adolescence |
title_fullStr | The Computational Development of Reinforcement Learning during Adolescence |
title_full_unstemmed | The Computational Development of Reinforcement Learning during Adolescence |
title_short | The Computational Development of Reinforcement Learning during Adolescence |
title_sort | computational development of reinforcement learning during adolescence |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4920542/ https://www.ncbi.nlm.nih.gov/pubmed/27322574 http://dx.doi.org/10.1371/journal.pcbi.1004953 |
work_keys_str_mv | AT palminteristefano thecomputationaldevelopmentofreinforcementlearningduringadolescence AT kilfordemmaj thecomputationaldevelopmentofreinforcementlearningduringadolescence AT coricelligiorgio thecomputationaldevelopmentofreinforcementlearningduringadolescence AT blakemoresarahjayne thecomputationaldevelopmentofreinforcementlearningduringadolescence AT palminteristefano computationaldevelopmentofreinforcementlearningduringadolescence AT kilfordemmaj computationaldevelopmentofreinforcementlearningduringadolescence AT coricelligiorgio computationaldevelopmentofreinforcementlearningduringadolescence AT blakemoresarahjayne computationaldevelopmentofreinforcementlearningduringadolescence |