Cargando…

Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning

Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chalvatzaki, Georgia, Younes, Ali, Nandha, Daljeet, Le, An Thai, Ribeiro, Leonardo F. R., Gurevych, Iryna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464606/
https://www.ncbi.nlm.nih.gov/pubmed/37649810
http://dx.doi.org/10.3389/frobt.2023.1221739
_version_ 1785098505741664256
author Chalvatzaki, Georgia
Younes, Ali
Nandha, Daljeet
Le, An Thai
Ribeiro, Leonardo F. R.
Gurevych, Iryna
author_facet Chalvatzaki, Georgia
Younes, Ali
Nandha, Daljeet
Le, An Thai
Ribeiro, Leonardo F. R.
Gurevych, Iryna
author_sort Chalvatzaki, Georgia
collection PubMed
description Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans, thereby learning to reason over long-horizon tasks, as encountered in the ALFRED benchmark. We compare our approach with classical planning and baseline methods to examine the applicability and generalizability of LLM-based planners. Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics.
format Online
Article
Text
id pubmed-10464606
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104646062023-08-30 Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna Front Robot AI Robotics and AI Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans, thereby learning to reason over long-horizon tasks, as encountered in the ALFRED benchmark. We compare our approach with classical planning and baseline methods to examine the applicability and generalizability of LLM-based planners. Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics. Frontiers Media S.A. 2023-08-15 /pmc/articles/PMC10464606/ /pubmed/37649810 http://dx.doi.org/10.3389/frobt.2023.1221739 Text en Copyright © 2023 Chalvatzaki, Younes, Nandha, Le, Ribeiro and Gurevych. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Chalvatzaki, Georgia
Younes, Ali
Nandha, Daljeet
Le, An Thai
Ribeiro, Leonardo F. R.
Gurevych, Iryna
Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_full Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_fullStr Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_full_unstemmed Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_short Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_sort learning to reason over scene graphs: a case study of finetuning gpt-2 into a robot language model for grounded task planning
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464606/
https://www.ncbi.nlm.nih.gov/pubmed/37649810
http://dx.doi.org/10.3389/frobt.2023.1221739
work_keys_str_mv AT chalvatzakigeorgia learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning
AT younesali learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning
AT nandhadaljeet learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning
AT leanthai learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning
AT ribeiroleonardofr learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning
AT gurevychiryna learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning