Cargando…

The Missing Link Between Memory and Reinforcement Learning

Reinforcement learning systems usually assume that a value function is defined over all states (or state-action pairs) that can immediately give the value of a particular state or action. These values are used by a selection mechanism to decide which action to take. In contrast, when humans and anim...

Descripción completa

Detalles Bibliográficos
Autores principales:	Balkenius, Christian, Tjøstheim, Trond A., Johansson, Birger, Wallin, Annika, Gärdenfors, Peter
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Psychology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7758424/ https://www.ncbi.nlm.nih.gov/pubmed/33362625 http://dx.doi.org/10.3389/fpsyg.2020.560080

_version_	1783626936421974016
author	Balkenius, Christian Tjøstheim, Trond A. Johansson, Birger Wallin, Annika Gärdenfors, Peter
author_facet	Balkenius, Christian Tjøstheim, Trond A. Johansson, Birger Wallin, Annika Gärdenfors, Peter
author_sort	Balkenius, Christian
collection	PubMed
description	Reinforcement learning systems usually assume that a value function is defined over all states (or state-action pairs) that can immediately give the value of a particular state or action. These values are used by a selection mechanism to decide which action to take. In contrast, when humans and animals make decisions, they collect evidence for different alternatives over time and take action only when sufficient evidence has been accumulated. We have previously developed a model of memory processing that includes semantic, episodic and working memory in a comprehensive architecture. Here, we describe how this memory mechanism can support decision making when the alternatives cannot be evaluated based on immediate sensory information alone. Instead we first imagine, and then evaluate a possible future that will result from choosing one of the alternatives. Here we present an extended model that can be used as a model for decision making that depends on accumulating evidence over time, whether that information comes from the sequential attention to different sensory properties or from internal simulation of the consequences of making a particular choice. We show how the new model explains both simple immediate choices, choices that depend on multiple sensory factors and complicated selections between alternatives that require forward looking simulations based on episodic and semantic memory structures. In this framework, vicarious trial and error is explained as an internal simulation that accumulates evidence for a particular choice. We argue that a system like this forms the “missing link” between more traditional ideas of semantic and episodic memory, and the associative nature of reinforcement learning.
format	Online Article Text
id	pubmed-7758424
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-77584242020-12-25 The Missing Link Between Memory and Reinforcement Learning Balkenius, Christian Tjøstheim, Trond A. Johansson, Birger Wallin, Annika Gärdenfors, Peter Front Psychol Psychology Reinforcement learning systems usually assume that a value function is defined over all states (or state-action pairs) that can immediately give the value of a particular state or action. These values are used by a selection mechanism to decide which action to take. In contrast, when humans and animals make decisions, they collect evidence for different alternatives over time and take action only when sufficient evidence has been accumulated. We have previously developed a model of memory processing that includes semantic, episodic and working memory in a comprehensive architecture. Here, we describe how this memory mechanism can support decision making when the alternatives cannot be evaluated based on immediate sensory information alone. Instead we first imagine, and then evaluate a possible future that will result from choosing one of the alternatives. Here we present an extended model that can be used as a model for decision making that depends on accumulating evidence over time, whether that information comes from the sequential attention to different sensory properties or from internal simulation of the consequences of making a particular choice. We show how the new model explains both simple immediate choices, choices that depend on multiple sensory factors and complicated selections between alternatives that require forward looking simulations based on episodic and semantic memory structures. In this framework, vicarious trial and error is explained as an internal simulation that accumulates evidence for a particular choice. We argue that a system like this forms the “missing link” between more traditional ideas of semantic and episodic memory, and the associative nature of reinforcement learning. Frontiers Media S.A. 2020-12-10 /pmc/articles/PMC7758424/ /pubmed/33362625 http://dx.doi.org/10.3389/fpsyg.2020.560080 Text en Copyright © 2020 Balkenius, Tjøstheim, Johansson, Wallin and Gärdenfors. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Psychology Balkenius, Christian Tjøstheim, Trond A. Johansson, Birger Wallin, Annika Gärdenfors, Peter The Missing Link Between Memory and Reinforcement Learning
title	The Missing Link Between Memory and Reinforcement Learning
title_full	The Missing Link Between Memory and Reinforcement Learning
title_fullStr	The Missing Link Between Memory and Reinforcement Learning
title_full_unstemmed	The Missing Link Between Memory and Reinforcement Learning
title_short	The Missing Link Between Memory and Reinforcement Learning
title_sort	missing link between memory and reinforcement learning
topic	Psychology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7758424/ https://www.ncbi.nlm.nih.gov/pubmed/33362625 http://dx.doi.org/10.3389/fpsyg.2020.560080
work_keys_str_mv	AT balkeniuschristian themissinglinkbetweenmemoryandreinforcementlearning AT tjøstheimtronda themissinglinkbetweenmemoryandreinforcementlearning AT johanssonbirger themissinglinkbetweenmemoryandreinforcementlearning AT wallinannika themissinglinkbetweenmemoryandreinforcementlearning AT gardenforspeter themissinglinkbetweenmemoryandreinforcementlearning AT balkeniuschristian missinglinkbetweenmemoryandreinforcementlearning AT tjøstheimtronda missinglinkbetweenmemoryandreinforcementlearning AT johanssonbirger missinglinkbetweenmemoryandreinforcementlearning AT wallinannika missinglinkbetweenmemoryandreinforcementlearning AT gardenforspeter missinglinkbetweenmemoryandreinforcementlearning

The Missing Link Between Memory and Reinforcement Learning

Ejemplares similares