Cargando…

Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning

Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chalvatzaki, Georgia, Younes, Ali, Nandha, Daljeet, Le, An Thai, Ribeiro, Leonardo F. R., Gurevych, Iryna
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464606/ https://www.ncbi.nlm.nih.gov/pubmed/37649810 http://dx.doi.org/10.3389/frobt.2023.1221739

_version_	1785098505741664256
author	Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna
author_facet	Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna
author_sort	Chalvatzaki, Georgia
collection	PubMed
description	Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans, thereby learning to reason over long-horizon tasks, as encountered in the ALFRED benchmark. We compare our approach with classical planning and baseline methods to examine the applicability and generalizability of LLM-based planners. Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics.
format	Online Article Text
id	pubmed-10464606
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-104646062023-08-30 Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna Front Robot AI Robotics and AI Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans, thereby learning to reason over long-horizon tasks, as encountered in the ALFRED benchmark. We compare our approach with classical planning and baseline methods to examine the applicability and generalizability of LLM-based planners. Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics. Frontiers Media S.A. 2023-08-15 /pmc/articles/PMC10464606/ /pubmed/37649810 http://dx.doi.org/10.3389/frobt.2023.1221739 Text en Copyright © 2023 Chalvatzaki, Younes, Nandha, Le, Ribeiro and Gurevych. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Robotics and AI Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title	Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_full	Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_fullStr	Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_full_unstemmed	Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_short	Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
title_sort	learning to reason over scene graphs: a case study of finetuning gpt-2 into a robot language model for grounded task planning
topic	Robotics and AI
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464606/ https://www.ncbi.nlm.nih.gov/pubmed/37649810 http://dx.doi.org/10.3389/frobt.2023.1221739
work_keys_str_mv	AT chalvatzakigeorgia learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT younesali learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT nandhadaljeet learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT leanthai learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT ribeiroleonardofr learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT gurevychiryna learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning

Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning

Ejemplares similares