Cargando…
Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning
Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464606/ https://www.ncbi.nlm.nih.gov/pubmed/37649810 http://dx.doi.org/10.3389/frobt.2023.1221739 |
_version_ | 1785098505741664256 |
---|---|
author | Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna |
author_facet | Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna |
author_sort | Chalvatzaki, Georgia |
collection | PubMed |
description | Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans, thereby learning to reason over long-horizon tasks, as encountered in the ALFRED benchmark. We compare our approach with classical planning and baseline methods to examine the applicability and generalizability of LLM-based planners. Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics. |
format | Online Article Text |
id | pubmed-10464606 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-104646062023-08-30 Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna Front Robot AI Robotics and AI Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans, thereby learning to reason over long-horizon tasks, as encountered in the ALFRED benchmark. We compare our approach with classical planning and baseline methods to examine the applicability and generalizability of LLM-based planners. Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics. Frontiers Media S.A. 2023-08-15 /pmc/articles/PMC10464606/ /pubmed/37649810 http://dx.doi.org/10.3389/frobt.2023.1221739 Text en Copyright © 2023 Chalvatzaki, Younes, Nandha, Le, Ribeiro and Gurevych. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Chalvatzaki, Georgia Younes, Ali Nandha, Daljeet Le, An Thai Ribeiro, Leonardo F. R. Gurevych, Iryna Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning |
title | Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning |
title_full | Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning |
title_fullStr | Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning |
title_full_unstemmed | Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning |
title_short | Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning |
title_sort | learning to reason over scene graphs: a case study of finetuning gpt-2 into a robot language model for grounded task planning |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464606/ https://www.ncbi.nlm.nih.gov/pubmed/37649810 http://dx.doi.org/10.3389/frobt.2023.1221739 |
work_keys_str_mv | AT chalvatzakigeorgia learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT younesali learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT nandhadaljeet learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT leanthai learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT ribeiroleonardofr learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning AT gurevychiryna learningtoreasonoverscenegraphsacasestudyoffinetuninggpt2intoarobotlanguagemodelforgroundedtaskplanning |