Cargando…
Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task
When making a choice with limited information, we explore new features through trial-and-error to learn how they are related. However, few studies have investigated exploratory behaviour when information is limited. In this study, we address, at both the behavioural and neural level, how, when, and...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5732284/ https://www.ncbi.nlm.nih.gov/pubmed/29247192 http://dx.doi.org/10.1038/s41598-017-17687-2 |
_version_ | 1783286660409065472 |
---|---|
author | Choung, Oh-hyeon Lee, Sang Wan Jeong, Yong |
author_facet | Choung, Oh-hyeon Lee, Sang Wan Jeong, Yong |
author_sort | Choung, Oh-hyeon |
collection | PubMed |
description | When making a choice with limited information, we explore new features through trial-and-error to learn how they are related. However, few studies have investigated exploratory behaviour when information is limited. In this study, we address, at both the behavioural and neural level, how, when, and why humans explore new feature dimensions to learn a new policy for choosing a state-space. We designed a novel multi-dimensional reinforcement learning task to encourage participants to explore and learn new features, then used a reinforcement learning algorithm to model policy exploration and learning behaviour. Our results provide the first evidence that, when humans explore new feature dimensions, their values are transferred from the previous policy to the new online (active) policy, as opposed to being learned from scratch. We further demonstrated that exploration may be regulated by the level of cognitive ambiguity, and that this process might be controlled by the frontopolar cortex. This opens up new possibilities of further understanding how humans explore new features in an open-space with limited information. |
format | Online Article Text |
id | pubmed-5732284 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-57322842017-12-21 Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task Choung, Oh-hyeon Lee, Sang Wan Jeong, Yong Sci Rep Article When making a choice with limited information, we explore new features through trial-and-error to learn how they are related. However, few studies have investigated exploratory behaviour when information is limited. In this study, we address, at both the behavioural and neural level, how, when, and why humans explore new feature dimensions to learn a new policy for choosing a state-space. We designed a novel multi-dimensional reinforcement learning task to encourage participants to explore and learn new features, then used a reinforcement learning algorithm to model policy exploration and learning behaviour. Our results provide the first evidence that, when humans explore new feature dimensions, their values are transferred from the previous policy to the new online (active) policy, as opposed to being learned from scratch. We further demonstrated that exploration may be regulated by the level of cognitive ambiguity, and that this process might be controlled by the frontopolar cortex. This opens up new possibilities of further understanding how humans explore new features in an open-space with limited information. Nature Publishing Group UK 2017-12-15 /pmc/articles/PMC5732284/ /pubmed/29247192 http://dx.doi.org/10.1038/s41598-017-17687-2 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Choung, Oh-hyeon Lee, Sang Wan Jeong, Yong Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task |
title | Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task |
title_full | Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task |
title_fullStr | Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task |
title_full_unstemmed | Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task |
title_short | Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task |
title_sort | exploring feature dimensions to learn a new policy in an uninformed reinforcement learning task |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5732284/ https://www.ncbi.nlm.nih.gov/pubmed/29247192 http://dx.doi.org/10.1038/s41598-017-17687-2 |
work_keys_str_mv | AT choungohhyeon exploringfeaturedimensionstolearnanewpolicyinanuninformedreinforcementlearningtask AT leesangwan exploringfeaturedimensionstolearnanewpolicyinanuninformedreinforcementlearningtask AT jeongyong exploringfeaturedimensionstolearnanewpolicyinanuninformedreinforcementlearningtask |