Cargando…

Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud

Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision,...

Descripción completa

Detalles Bibliográficos
Autores principales: Jung, Tae-Won, Jeong, Chi-Seo, Kim, In-Seon, Yu, Min-Su, Kwon, Soon-Chul, Jung, Kye-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9656959/
https://www.ncbi.nlm.nih.gov/pubmed/36365864
http://dx.doi.org/10.3390/s22218166
_version_ 1784829569940848640
author Jung, Tae-Won
Jeong, Chi-Seo
Kim, In-Seon
Yu, Min-Su
Kwon, Soon-Chul
Jung, Kye-Dong
author_facet Jung, Tae-Won
Jeong, Chi-Seo
Kim, In-Seon
Yu, Min-Su
Kwon, Soon-Chul
Jung, Kye-Dong
author_sort Jung, Tae-Won
collection PubMed
description Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision, utilize conventional Convolutional Neural Networks (CNNs) to process data supported by graphs. This paper proposes a one-stage GCN approach for 3D object detection and poses estimation by structuring non-linearly distributed points of a graph. Our network provides the required details to analyze, generate and estimate bounding boxes by spatially structuring the input data into graphs. Our method proposes a keypoint attention mechanism that aggregates the relative features between each point to estimate the category and pose of the object to which the vertices of the graph belong, and also designs nine degrees of freedom of multi-object pose estimation. In addition, to avoid gimbal lock in 3D space, we use quaternion rotation, instead of Euler angle. Experimental results showed that memory usage and efficiency could be improved by aggregating point features from the point cloud and their neighbors in a graph structure. Overall, the system achieved comparable performance against state-of-the-art systems.
format Online
Article
Text
id pubmed-9656959
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96569592022-11-15 Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong Sensors (Basel) Article Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision, utilize conventional Convolutional Neural Networks (CNNs) to process data supported by graphs. This paper proposes a one-stage GCN approach for 3D object detection and poses estimation by structuring non-linearly distributed points of a graph. Our network provides the required details to analyze, generate and estimate bounding boxes by spatially structuring the input data into graphs. Our method proposes a keypoint attention mechanism that aggregates the relative features between each point to estimate the category and pose of the object to which the vertices of the graph belong, and also designs nine degrees of freedom of multi-object pose estimation. In addition, to avoid gimbal lock in 3D space, we use quaternion rotation, instead of Euler angle. Experimental results showed that memory usage and efficiency could be improved by aggregating point features from the point cloud and their neighbors in a graph structure. Overall, the system achieved comparable performance against state-of-the-art systems. MDPI 2022-10-25 /pmc/articles/PMC9656959/ /pubmed/36365864 http://dx.doi.org/10.3390/s22218166 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jung, Tae-Won
Jeong, Chi-Seo
Kim, In-Seon
Yu, Min-Su
Kwon, Soon-Chul
Jung, Kye-Dong
Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_full Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_fullStr Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_full_unstemmed Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_short Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_sort graph convolutional network for 3d object pose estimation in a point cloud
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9656959/
https://www.ncbi.nlm.nih.gov/pubmed/36365864
http://dx.doi.org/10.3390/s22218166
work_keys_str_mv AT jungtaewon graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud
AT jeongchiseo graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud
AT kiminseon graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud
AT yuminsu graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud
AT kwonsoonchul graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud
AT jungkyedong graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud