Cargando…

Interactively learning behavior trees from imperfect human demonstrations

Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most exis...

Descripción completa

Detalles Bibliográficos
Autores principales: Scherf, Lisa, Schmidt, Aljoscha, Pal, Suman, Koert, Dorothea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368948/
https://www.ncbi.nlm.nih.gov/pubmed/37501742
http://dx.doi.org/10.3389/frobt.2023.1152595
_version_ 1785077616281124864
author Scherf, Lisa
Schmidt, Aljoscha
Pal, Suman
Koert, Dorothea
author_facet Scherf, Lisa
Schmidt, Aljoscha
Pal, Suman
Koert, Dorothea
author_sort Scherf, Lisa
collection PubMed
description Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most existing approaches that learn a BT from human demonstrations require the user to specify each action step-by-step or do not allow for adapting a learned BT without the need to repeat the entire teaching process from scratch. Method: We propose a new framework to directly learn a BT from only a few human task demonstrations recorded as RGB-D video streams. We automatically extract continuous pre- and post-conditions for BT action nodes from visual features and use a Backchaining approach to build a reactive BT. In a user study on how non-experts provide and vary demonstrations, we identify three common failure cases of an BT learned from potentially imperfect initial human demonstrations. We offer a way to interactively resolve these failure cases by refining the existing BT through interaction with a user over a web-interface. Specifically, failure cases or unknown states are detected automatically during the execution of a learned BT and the initial BT is adjusted or extended according to the provided user input. Evaluation and results: We evaluate our approach on a robotic trash disposal task with 20 human participants and demonstrate that our method is capable of learning reactive BTs from only a few human demonstrations and interactively resolving possible failure cases at runtime.
format Online
Article
Text
id pubmed-10368948
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103689482023-07-27 Interactively learning behavior trees from imperfect human demonstrations Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea Front Robot AI Robotics and AI Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most existing approaches that learn a BT from human demonstrations require the user to specify each action step-by-step or do not allow for adapting a learned BT without the need to repeat the entire teaching process from scratch. Method: We propose a new framework to directly learn a BT from only a few human task demonstrations recorded as RGB-D video streams. We automatically extract continuous pre- and post-conditions for BT action nodes from visual features and use a Backchaining approach to build a reactive BT. In a user study on how non-experts provide and vary demonstrations, we identify three common failure cases of an BT learned from potentially imperfect initial human demonstrations. We offer a way to interactively resolve these failure cases by refining the existing BT through interaction with a user over a web-interface. Specifically, failure cases or unknown states are detected automatically during the execution of a learned BT and the initial BT is adjusted or extended according to the provided user input. Evaluation and results: We evaluate our approach on a robotic trash disposal task with 20 human participants and demonstrate that our method is capable of learning reactive BTs from only a few human demonstrations and interactively resolving possible failure cases at runtime. Frontiers Media S.A. 2023-07-12 /pmc/articles/PMC10368948/ /pubmed/37501742 http://dx.doi.org/10.3389/frobt.2023.1152595 Text en Copyright © 2023 Scherf, Schmidt, Pal and Koert. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Scherf, Lisa
Schmidt, Aljoscha
Pal, Suman
Koert, Dorothea
Interactively learning behavior trees from imperfect human demonstrations
title Interactively learning behavior trees from imperfect human demonstrations
title_full Interactively learning behavior trees from imperfect human demonstrations
title_fullStr Interactively learning behavior trees from imperfect human demonstrations
title_full_unstemmed Interactively learning behavior trees from imperfect human demonstrations
title_short Interactively learning behavior trees from imperfect human demonstrations
title_sort interactively learning behavior trees from imperfect human demonstrations
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368948/
https://www.ncbi.nlm.nih.gov/pubmed/37501742
http://dx.doi.org/10.3389/frobt.2023.1152595
work_keys_str_mv AT scherflisa interactivelylearningbehaviortreesfromimperfecthumandemonstrations
AT schmidtaljoscha interactivelylearningbehaviortreesfromimperfecthumandemonstrations
AT palsuman interactivelylearningbehaviortreesfromimperfecthumandemonstrations
AT koertdorothea interactivelylearningbehaviortreesfromimperfecthumandemonstrations