Cargando…

SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks

An embodied, autonomous agent able to set its own goals has to possess geometrical reasoning abilities for judging whether its goals have been achieved, namely it should be able to identify and discriminate classes of configurations of objects, irrespective of its point of view on the scene. However...

Descripción completa

Detalles Bibliográficos
Autores principales: Teodorescu, Laetitia, Hofmann, Katja, Oudeyer, Pierre-Yves
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8826049/
https://www.ncbi.nlm.nih.gov/pubmed/35156011
http://dx.doi.org/10.3389/frai.2021.782081
_version_ 1784647350481846272
author Teodorescu, Laetitia
Hofmann, Katja
Oudeyer, Pierre-Yves
author_facet Teodorescu, Laetitia
Hofmann, Katja
Oudeyer, Pierre-Yves
author_sort Teodorescu, Laetitia
collection PubMed
description An embodied, autonomous agent able to set its own goals has to possess geometrical reasoning abilities for judging whether its goals have been achieved, namely it should be able to identify and discriminate classes of configurations of objects, irrespective of its point of view on the scene. However, this problem has received little attention so far in the deep learning literature. In this paper we make two key contributions. First, we propose SpatialSim (Spatial Similarity), a novel geometrical reasoning diagnostic dataset, and argue that progress on this benchmark would allow for diagnosing more principled approaches to this problem. This benchmark is composed of two tasks: “Identification” and “Discrimination,” each one instantiated in increasing levels of difficulty. Secondly, we validate that relational inductive biases—exhibited by fully-connected message-passing Graph Neural Networks (MPGNNs)—are instrumental to solve those tasks, and show their advantages over less relational baselines such as Deep Sets and unstructured models such as Multi-Layer Perceptrons. We additionally showcase the failure of high-capacity CNNs on the hard Discrimination task. Finally, we highlight the current limits of GNNs in both tasks.
format Online
Article
Text
id pubmed-8826049
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88260492022-02-10 SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks Teodorescu, Laetitia Hofmann, Katja Oudeyer, Pierre-Yves Front Artif Intell Artificial Intelligence An embodied, autonomous agent able to set its own goals has to possess geometrical reasoning abilities for judging whether its goals have been achieved, namely it should be able to identify and discriminate classes of configurations of objects, irrespective of its point of view on the scene. However, this problem has received little attention so far in the deep learning literature. In this paper we make two key contributions. First, we propose SpatialSim (Spatial Similarity), a novel geometrical reasoning diagnostic dataset, and argue that progress on this benchmark would allow for diagnosing more principled approaches to this problem. This benchmark is composed of two tasks: “Identification” and “Discrimination,” each one instantiated in increasing levels of difficulty. Secondly, we validate that relational inductive biases—exhibited by fully-connected message-passing Graph Neural Networks (MPGNNs)—are instrumental to solve those tasks, and show their advantages over less relational baselines such as Deep Sets and unstructured models such as Multi-Layer Perceptrons. We additionally showcase the failure of high-capacity CNNs on the hard Discrimination task. Finally, we highlight the current limits of GNNs in both tasks. Frontiers Media S.A. 2022-01-26 /pmc/articles/PMC8826049/ /pubmed/35156011 http://dx.doi.org/10.3389/frai.2021.782081 Text en Copyright © 2022 Teodorescu, Hofmann and Oudeyer. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Teodorescu, Laetitia
Hofmann, Katja
Oudeyer, Pierre-Yves
SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks
title SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks
title_full SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks
title_fullStr SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks
title_full_unstemmed SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks
title_short SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks
title_sort spatialsim: recognizing spatial configurations of objects with graph neural networks
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8826049/
https://www.ncbi.nlm.nih.gov/pubmed/35156011
http://dx.doi.org/10.3389/frai.2021.782081
work_keys_str_mv AT teodoresculaetitia spatialsimrecognizingspatialconfigurationsofobjectswithgraphneuralnetworks
AT hofmannkatja spatialsimrecognizingspatialconfigurationsofobjectswithgraphneuralnetworks
AT oudeyerpierreyves spatialsimrecognizingspatialconfigurationsofobjectswithgraphneuralnetworks