Cargando…

Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network

It is crucial to ask how agents can achieve goals by generating action plans using only partial models of the world acquired through habituated sensory-motor experiences. Although many existing robotics studies use a forward model framework, there are generalization issues with high degrees of freed...

Descripción completa

Detalles Bibliográficos
Autores principales:	Matsumoto, Takazumi, Tani, Jun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517093/ https://www.ncbi.nlm.nih.gov/pubmed/33286336 http://dx.doi.org/10.3390/e22050564

_version_	1783587150754742272
author	Matsumoto, Takazumi Tani, Jun
author_facet	Matsumoto, Takazumi Tani, Jun
author_sort	Matsumoto, Takazumi
collection	PubMed
description	It is crucial to ask how agents can achieve goals by generating action plans using only partial models of the world acquired through habituated sensory-motor experiences. Although many existing robotics studies use a forward model framework, there are generalization issues with high degrees of freedom. The current study shows that the predictive coding (PC) and active inference (AIF) frameworks, which employ a generative model, can develop better generalization by learning a prior distribution in a low dimensional latent state space representing probabilistic structures extracted from well habituated sensory-motor trajectories. In our proposed model, learning is carried out by inferring optimal latent variables as well as synaptic weights for maximizing the evidence lower bound, while goal-directed planning is accomplished by inferring latent variables for maximizing the estimated lower bound. Our proposed model was evaluated with both simple and complex robotic tasks in simulation, which demonstrated sufficient generalization in learning with limited training data by setting an intermediate value for a regularization coefficient. Furthermore, comparative simulation results show that the proposed model outperforms a conventional forward model in goal-directed planning, due to the learned prior confining the search of motor plans within the range of habituated trajectories.
format	Online Article Text
id	pubmed-7517093
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75170932020-11-09 Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network Matsumoto, Takazumi Tani, Jun Entropy (Basel) Article It is crucial to ask how agents can achieve goals by generating action plans using only partial models of the world acquired through habituated sensory-motor experiences. Although many existing robotics studies use a forward model framework, there are generalization issues with high degrees of freedom. The current study shows that the predictive coding (PC) and active inference (AIF) frameworks, which employ a generative model, can develop better generalization by learning a prior distribution in a low dimensional latent state space representing probabilistic structures extracted from well habituated sensory-motor trajectories. In our proposed model, learning is carried out by inferring optimal latent variables as well as synaptic weights for maximizing the evidence lower bound, while goal-directed planning is accomplished by inferring latent variables for maximizing the estimated lower bound. Our proposed model was evaluated with both simple and complex robotic tasks in simulation, which demonstrated sufficient generalization in learning with limited training data by setting an intermediate value for a regularization coefficient. Furthermore, comparative simulation results show that the proposed model outperforms a conventional forward model in goal-directed planning, due to the learned prior confining the search of motor plans within the range of habituated trajectories. MDPI 2020-05-18 /pmc/articles/PMC7517093/ /pubmed/33286336 http://dx.doi.org/10.3390/e22050564 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Matsumoto, Takazumi Tani, Jun Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network
title	Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network
title_full	Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network
title_fullStr	Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network
title_full_unstemmed	Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network
title_short	Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network
title_sort	goal-directed planning for habituated agents by active inference using a variational recurrent neural network
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517093/ https://www.ncbi.nlm.nih.gov/pubmed/33286336 http://dx.doi.org/10.3390/e22050564
work_keys_str_mv	AT matsumototakazumi goaldirectedplanningforhabituatedagentsbyactiveinferenceusingavariationalrecurrentneuralnetwork AT tanijun goaldirectedplanningforhabituatedagentsbyactiveinferenceusingavariationalrecurrentneuralnetwork

Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network

Ejemplares similares