Cargando…

Interactively learning behavior trees from imperfect human demonstrations

Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most exis...

Descripción completa

Detalles Bibliográficos
Autores principales:	Scherf, Lisa, Schmidt, Aljoscha, Pal, Suman, Koert, Dorothea
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368948/ https://www.ncbi.nlm.nih.gov/pubmed/37501742 http://dx.doi.org/10.3389/frobt.2023.1152595

_version_	1785077616281124864
author	Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea
author_facet	Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea
author_sort	Scherf, Lisa
collection	PubMed
description	Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most existing approaches that learn a BT from human demonstrations require the user to specify each action step-by-step or do not allow for adapting a learned BT without the need to repeat the entire teaching process from scratch. Method: We propose a new framework to directly learn a BT from only a few human task demonstrations recorded as RGB-D video streams. We automatically extract continuous pre- and post-conditions for BT action nodes from visual features and use a Backchaining approach to build a reactive BT. In a user study on how non-experts provide and vary demonstrations, we identify three common failure cases of an BT learned from potentially imperfect initial human demonstrations. We offer a way to interactively resolve these failure cases by refining the existing BT through interaction with a user over a web-interface. Specifically, failure cases or unknown states are detected automatically during the execution of a learned BT and the initial BT is adjusted or extended according to the provided user input. Evaluation and results: We evaluate our approach on a robotic trash disposal task with 20 human participants and demonstrate that our method is capable of learning reactive BTs from only a few human demonstrations and interactively resolving possible failure cases at runtime.
format	Online Article Text
id	pubmed-10368948
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-103689482023-07-27 Interactively learning behavior trees from imperfect human demonstrations Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea Front Robot AI Robotics and AI Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most existing approaches that learn a BT from human demonstrations require the user to specify each action step-by-step or do not allow for adapting a learned BT without the need to repeat the entire teaching process from scratch. Method: We propose a new framework to directly learn a BT from only a few human task demonstrations recorded as RGB-D video streams. We automatically extract continuous pre- and post-conditions for BT action nodes from visual features and use a Backchaining approach to build a reactive BT. In a user study on how non-experts provide and vary demonstrations, we identify three common failure cases of an BT learned from potentially imperfect initial human demonstrations. We offer a way to interactively resolve these failure cases by refining the existing BT through interaction with a user over a web-interface. Specifically, failure cases or unknown states are detected automatically during the execution of a learned BT and the initial BT is adjusted or extended according to the provided user input. Evaluation and results: We evaluate our approach on a robotic trash disposal task with 20 human participants and demonstrate that our method is capable of learning reactive BTs from only a few human demonstrations and interactively resolving possible failure cases at runtime. Frontiers Media S.A. 2023-07-12 /pmc/articles/PMC10368948/ /pubmed/37501742 http://dx.doi.org/10.3389/frobt.2023.1152595 Text en Copyright © 2023 Scherf, Schmidt, Pal and Koert. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Robotics and AI Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea Interactively learning behavior trees from imperfect human demonstrations
title	Interactively learning behavior trees from imperfect human demonstrations
title_full	Interactively learning behavior trees from imperfect human demonstrations
title_fullStr	Interactively learning behavior trees from imperfect human demonstrations
title_full_unstemmed	Interactively learning behavior trees from imperfect human demonstrations
title_short	Interactively learning behavior trees from imperfect human demonstrations
title_sort	interactively learning behavior trees from imperfect human demonstrations
topic	Robotics and AI
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368948/ https://www.ncbi.nlm.nih.gov/pubmed/37501742 http://dx.doi.org/10.3389/frobt.2023.1152595
work_keys_str_mv	AT scherflisa interactivelylearningbehaviortreesfromimperfecthumandemonstrations AT schmidtaljoscha interactivelylearningbehaviortreesfromimperfecthumandemonstrations AT palsuman interactivelylearningbehaviortreesfromimperfecthumandemonstrations AT koertdorothea interactivelylearningbehaviortreesfromimperfecthumandemonstrations

Interactively learning behavior trees from imperfect human demonstrations

Ejemplares similares