Cargando…
Interactively learning behavior trees from imperfect human demonstrations
Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most exis...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368948/ https://www.ncbi.nlm.nih.gov/pubmed/37501742 http://dx.doi.org/10.3389/frobt.2023.1152595 |
_version_ | 1785077616281124864 |
---|---|
author | Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea |
author_facet | Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea |
author_sort | Scherf, Lisa |
collection | PubMed |
description | Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most existing approaches that learn a BT from human demonstrations require the user to specify each action step-by-step or do not allow for adapting a learned BT without the need to repeat the entire teaching process from scratch. Method: We propose a new framework to directly learn a BT from only a few human task demonstrations recorded as RGB-D video streams. We automatically extract continuous pre- and post-conditions for BT action nodes from visual features and use a Backchaining approach to build a reactive BT. In a user study on how non-experts provide and vary demonstrations, we identify three common failure cases of an BT learned from potentially imperfect initial human demonstrations. We offer a way to interactively resolve these failure cases by refining the existing BT through interaction with a user over a web-interface. Specifically, failure cases or unknown states are detected automatically during the execution of a learned BT and the initial BT is adjusted or extended according to the provided user input. Evaluation and results: We evaluate our approach on a robotic trash disposal task with 20 human participants and demonstrate that our method is capable of learning reactive BTs from only a few human demonstrations and interactively resolving possible failure cases at runtime. |
format | Online Article Text |
id | pubmed-10368948 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-103689482023-07-27 Interactively learning behavior trees from imperfect human demonstrations Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea Front Robot AI Robotics and AI Introduction: In Interactive Task Learning (ITL), an agent learns a new task through natural interaction with a human instructor. Behavior Trees (BTs) offer a reactive, modular, and interpretable way of encoding task descriptions but have not yet been applied a lot in robotic ITL settings. Most existing approaches that learn a BT from human demonstrations require the user to specify each action step-by-step or do not allow for adapting a learned BT without the need to repeat the entire teaching process from scratch. Method: We propose a new framework to directly learn a BT from only a few human task demonstrations recorded as RGB-D video streams. We automatically extract continuous pre- and post-conditions for BT action nodes from visual features and use a Backchaining approach to build a reactive BT. In a user study on how non-experts provide and vary demonstrations, we identify three common failure cases of an BT learned from potentially imperfect initial human demonstrations. We offer a way to interactively resolve these failure cases by refining the existing BT through interaction with a user over a web-interface. Specifically, failure cases or unknown states are detected automatically during the execution of a learned BT and the initial BT is adjusted or extended according to the provided user input. Evaluation and results: We evaluate our approach on a robotic trash disposal task with 20 human participants and demonstrate that our method is capable of learning reactive BTs from only a few human demonstrations and interactively resolving possible failure cases at runtime. Frontiers Media S.A. 2023-07-12 /pmc/articles/PMC10368948/ /pubmed/37501742 http://dx.doi.org/10.3389/frobt.2023.1152595 Text en Copyright © 2023 Scherf, Schmidt, Pal and Koert. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Scherf, Lisa Schmidt, Aljoscha Pal, Suman Koert, Dorothea Interactively learning behavior trees from imperfect human demonstrations |
title | Interactively learning behavior trees from imperfect human demonstrations |
title_full | Interactively learning behavior trees from imperfect human demonstrations |
title_fullStr | Interactively learning behavior trees from imperfect human demonstrations |
title_full_unstemmed | Interactively learning behavior trees from imperfect human demonstrations |
title_short | Interactively learning behavior trees from imperfect human demonstrations |
title_sort | interactively learning behavior trees from imperfect human demonstrations |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368948/ https://www.ncbi.nlm.nih.gov/pubmed/37501742 http://dx.doi.org/10.3389/frobt.2023.1152595 |
work_keys_str_mv | AT scherflisa interactivelylearningbehaviortreesfromimperfecthumandemonstrations AT schmidtaljoscha interactivelylearningbehaviortreesfromimperfecthumandemonstrations AT palsuman interactivelylearningbehaviortreesfromimperfecthumandemonstrations AT koertdorothea interactivelylearningbehaviortreesfromimperfecthumandemonstrations |