Cargando…

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making

Classic reinforcement learning (RL) theories cannot explain human behavior in the absence of external reward or when the environment changes. Here, we employ a deep sequential decision-making paradigm with sparse reward and abrupt environmental changes. To explain the behavior of human participants...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xu, He A., Modirshanechi, Alireza, Lehmann, Marco P., Gerstner, Wulfram, Herzog, Michael H.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8205159/ https://www.ncbi.nlm.nih.gov/pubmed/34081705 http://dx.doi.org/10.1371/journal.pcbi.1009070

_version_	1783708454385352704
author	Xu, He A. Modirshanechi, Alireza Lehmann, Marco P. Gerstner, Wulfram Herzog, Michael H.
author_facet	Xu, He A. Modirshanechi, Alireza Lehmann, Marco P. Gerstner, Wulfram Herzog, Michael H.
author_sort	Xu, He A.
collection	PubMed
description	Classic reinforcement learning (RL) theories cannot explain human behavior in the absence of external reward or when the environment changes. Here, we employ a deep sequential decision-making paradigm with sparse reward and abrupt environmental changes. To explain the behavior of human participants in these environments, we show that RL theories need to include surprise and novelty, each with a distinct role. While novelty drives exploration before the first encounter of a reward, surprise increases the rate of learning of a world-model as well as of model-free action-values. Even though the world-model is available for model-based RL, we find that human decisions are dominated by model-free action choices. The world-model is only marginally used for planning, but it is important to detect surprising events. Our theory predicts human action choices with high probability and allows us to dissociate surprise, novelty, and reward in EEG signals.
format	Online Article Text
id	pubmed-8205159
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-82051592021-06-29 Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making Xu, He A. Modirshanechi, Alireza Lehmann, Marco P. Gerstner, Wulfram Herzog, Michael H. PLoS Comput Biol Research Article Classic reinforcement learning (RL) theories cannot explain human behavior in the absence of external reward or when the environment changes. Here, we employ a deep sequential decision-making paradigm with sparse reward and abrupt environmental changes. To explain the behavior of human participants in these environments, we show that RL theories need to include surprise and novelty, each with a distinct role. While novelty drives exploration before the first encounter of a reward, surprise increases the rate of learning of a world-model as well as of model-free action-values. Even though the world-model is available for model-based RL, we find that human decisions are dominated by model-free action choices. The world-model is only marginally used for planning, but it is important to detect surprising events. Our theory predicts human action choices with high probability and allows us to dissociate surprise, novelty, and reward in EEG signals. Public Library of Science 2021-06-03 /pmc/articles/PMC8205159/ /pubmed/34081705 http://dx.doi.org/10.1371/journal.pcbi.1009070 Text en © 2021 Xu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Xu, He A. Modirshanechi, Alireza Lehmann, Marco P. Gerstner, Wulfram Herzog, Michael H. Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making
title	Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making
title_full	Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making
title_fullStr	Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making
title_full_unstemmed	Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making
title_short	Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making
title_sort	novelty is not surprise: human exploratory and adaptive behavior in sequential decision-making
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8205159/ https://www.ncbi.nlm.nih.gov/pubmed/34081705 http://dx.doi.org/10.1371/journal.pcbi.1009070
work_keys_str_mv	AT xuhea noveltyisnotsurprisehumanexploratoryandadaptivebehaviorinsequentialdecisionmaking AT modirshanechialireza noveltyisnotsurprisehumanexploratoryandadaptivebehaviorinsequentialdecisionmaking AT lehmannmarcop noveltyisnotsurprisehumanexploratoryandadaptivebehaviorinsequentialdecisionmaking AT gerstnerwulfram noveltyisnotsurprisehumanexploratoryandadaptivebehaviorinsequentialdecisionmaking AT herzogmichaelh noveltyisnotsurprisehumanexploratoryandadaptivebehaviorinsequentialdecisionmaking

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making

Ejemplares similares