Cargando…

Generating Adaptive Behaviour within a Memory-Prediction Framework

The Memory-Prediction Framework (MPF) and its Hierarchical-Temporal Memory implementation (HTM) have been widely applied to unsupervised learning problems, for both classification and prediction. To date, there has been no attempt to incorporate MPF/HTM in reinforcement learning or other adaptive sy...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rawlinson, David, Kowadlo, Gideon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2012
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3260147/ https://www.ncbi.nlm.nih.gov/pubmed/22272231 http://dx.doi.org/10.1371/journal.pone.0029264

_version_	1782221444856414208
author	Rawlinson, David Kowadlo, Gideon
author_facet	Rawlinson, David Kowadlo, Gideon
author_sort	Rawlinson, David
collection	PubMed
description	The Memory-Prediction Framework (MPF) and its Hierarchical-Temporal Memory implementation (HTM) have been widely applied to unsupervised learning problems, for both classification and prediction. To date, there has been no attempt to incorporate MPF/HTM in reinforcement learning or other adaptive systems; that is, to use knowledge embodied within the hierarchy to control a system, or to generate behaviour for an agent. This problem is interesting because the human neocortex is believed to play a vital role in the generation of behaviour, and the MPF is a model of the human neocortex. We propose some simple and biologically-plausible enhancements to the Memory-Prediction Framework. These cause it to explore and interact with an external world, while trying to maximize a continuous, time-varying reward function. All behaviour is generated and controlled within the MPF hierarchy. The hierarchy develops from a random initial configuration by interaction with the world and reinforcement learning only. Among other demonstrations, we show that a 2-node hierarchy can learn to successfully play “rocks, paper, scissors” against a predictable opponent.
format	Online Article Text
id	pubmed-3260147
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-32601472012-01-23 Generating Adaptive Behaviour within a Memory-Prediction Framework Rawlinson, David Kowadlo, Gideon PLoS One Research Article The Memory-Prediction Framework (MPF) and its Hierarchical-Temporal Memory implementation (HTM) have been widely applied to unsupervised learning problems, for both classification and prediction. To date, there has been no attempt to incorporate MPF/HTM in reinforcement learning or other adaptive systems; that is, to use knowledge embodied within the hierarchy to control a system, or to generate behaviour for an agent. This problem is interesting because the human neocortex is believed to play a vital role in the generation of behaviour, and the MPF is a model of the human neocortex. We propose some simple and biologically-plausible enhancements to the Memory-Prediction Framework. These cause it to explore and interact with an external world, while trying to maximize a continuous, time-varying reward function. All behaviour is generated and controlled within the MPF hierarchy. The hierarchy develops from a random initial configuration by interaction with the world and reinforcement learning only. Among other demonstrations, we show that a 2-node hierarchy can learn to successfully play “rocks, paper, scissors” against a predictable opponent. Public Library of Science 2012-01-17 /pmc/articles/PMC3260147/ /pubmed/22272231 http://dx.doi.org/10.1371/journal.pone.0029264 Text en Rawlinson, Kowadlo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Rawlinson, David Kowadlo, Gideon Generating Adaptive Behaviour within a Memory-Prediction Framework
title	Generating Adaptive Behaviour within a Memory-Prediction Framework
title_full	Generating Adaptive Behaviour within a Memory-Prediction Framework
title_fullStr	Generating Adaptive Behaviour within a Memory-Prediction Framework
title_full_unstemmed	Generating Adaptive Behaviour within a Memory-Prediction Framework
title_short	Generating Adaptive Behaviour within a Memory-Prediction Framework
title_sort	generating adaptive behaviour within a memory-prediction framework
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3260147/ https://www.ncbi.nlm.nih.gov/pubmed/22272231 http://dx.doi.org/10.1371/journal.pone.0029264
work_keys_str_mv	AT rawlinsondavid generatingadaptivebehaviourwithinamemorypredictionframework AT kowadlogideon generatingadaptivebehaviourwithinamemorypredictionframework

Generating Adaptive Behaviour within a Memory-Prediction Framework

Ejemplares similares