Cargando…

Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots

A reinforcement learning agent that autonomously explores its environment can utilize a curiosity drive to enable continual learning of skills, in the absence of any external rewards. We formulate curiosity-driven exploration, and eventual skill acquisition, as a selective sampling problem. Each env...

Descripción completa

Detalles Bibliográficos
Autores principales: Ngo, Hung, Luciw, Matthew, Förster, Alexander, Schmidhuber, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3840616/
https://www.ncbi.nlm.nih.gov/pubmed/24324448
http://dx.doi.org/10.3389/fpsyg.2013.00833
_version_ 1782478535626063872
author Ngo, Hung
Luciw, Matthew
Förster, Alexander
Schmidhuber, Jürgen
author_facet Ngo, Hung
Luciw, Matthew
Förster, Alexander
Schmidhuber, Jürgen
author_sort Ngo, Hung
collection PubMed
description A reinforcement learning agent that autonomously explores its environment can utilize a curiosity drive to enable continual learning of skills, in the absence of any external rewards. We formulate curiosity-driven exploration, and eventual skill acquisition, as a selective sampling problem. Each environment setting provides the agent with a stream of instances. An instance is a sensory observation that, when queried, causes an outcome that the agent is trying to predict. After an instance is observed, a query condition, derived herein, tells whether its outcome is statistically known or unknown to the agent, based on the confidence interval of an online linear classifier. Upon encountering the first unknown instance, the agent “queries” the environment to observe the outcome, which is expected to improve its confidence in the corresponding predictor. If the environment is in a setting where all instances are known, the agent generates a plan of actions to reach a new setting, where an unknown instance is likely to be encountered. The desired setting is a self-generated goal, and the plan of action, essentially a program to solve a problem, is a skill. The success of the plan depends on the quality of the agent's predictors, which are improved as mentioned above. For validation, this method is applied to both a simulated and real Katana robot arm in its “blocks-world” environment. Results show that the proposed method generates sample-efficient curious exploration behavior, which exhibits developmental stages, continual learning, and skill acquisition, in an intrinsically-motivated playful agent.
format Online
Article
Text
id pubmed-3840616
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-38406162013-12-09 Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots Ngo, Hung Luciw, Matthew Förster, Alexander Schmidhuber, Jürgen Front Psychol Psychology A reinforcement learning agent that autonomously explores its environment can utilize a curiosity drive to enable continual learning of skills, in the absence of any external rewards. We formulate curiosity-driven exploration, and eventual skill acquisition, as a selective sampling problem. Each environment setting provides the agent with a stream of instances. An instance is a sensory observation that, when queried, causes an outcome that the agent is trying to predict. After an instance is observed, a query condition, derived herein, tells whether its outcome is statistically known or unknown to the agent, based on the confidence interval of an online linear classifier. Upon encountering the first unknown instance, the agent “queries” the environment to observe the outcome, which is expected to improve its confidence in the corresponding predictor. If the environment is in a setting where all instances are known, the agent generates a plan of actions to reach a new setting, where an unknown instance is likely to be encountered. The desired setting is a self-generated goal, and the plan of action, essentially a program to solve a problem, is a skill. The success of the plan depends on the quality of the agent's predictors, which are improved as mentioned above. For validation, this method is applied to both a simulated and real Katana robot arm in its “blocks-world” environment. Results show that the proposed method generates sample-efficient curious exploration behavior, which exhibits developmental stages, continual learning, and skill acquisition, in an intrinsically-motivated playful agent. Frontiers Media S.A. 2013-11-26 /pmc/articles/PMC3840616/ /pubmed/24324448 http://dx.doi.org/10.3389/fpsyg.2013.00833 Text en Copyright © 2013 Ngo, Luciw, Förster and Schmidhuber. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Ngo, Hung
Luciw, Matthew
Förster, Alexander
Schmidhuber, Jürgen
Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
title Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
title_full Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
title_fullStr Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
title_full_unstemmed Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
title_short Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
title_sort confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3840616/
https://www.ncbi.nlm.nih.gov/pubmed/24324448
http://dx.doi.org/10.3389/fpsyg.2013.00833
work_keys_str_mv AT ngohung confidencebasedprogressdrivenselfgeneratedgoalsforskillacquisitionindevelopmentalrobots
AT luciwmatthew confidencebasedprogressdrivenselfgeneratedgoalsforskillacquisitionindevelopmentalrobots
AT forsteralexander confidencebasedprogressdrivenselfgeneratedgoalsforskillacquisitionindevelopmentalrobots
AT schmidhuberjurgen confidencebasedprogressdrivenselfgeneratedgoalsforskillacquisitionindevelopmentalrobots