Cargando…

Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud

Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jung, Tae-Won, Jeong, Chi-Seo, Kim, In-Seon, Yu, Min-Su, Kwon, Soon-Chul, Jung, Kye-Dong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9656959/ https://www.ncbi.nlm.nih.gov/pubmed/36365864 http://dx.doi.org/10.3390/s22218166

_version_	1784829569940848640
author	Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong
author_facet	Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong
author_sort	Jung, Tae-Won
collection	PubMed
description	Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision, utilize conventional Convolutional Neural Networks (CNNs) to process data supported by graphs. This paper proposes a one-stage GCN approach for 3D object detection and poses estimation by structuring non-linearly distributed points of a graph. Our network provides the required details to analyze, generate and estimate bounding boxes by spatially structuring the input data into graphs. Our method proposes a keypoint attention mechanism that aggregates the relative features between each point to estimate the category and pose of the object to which the vertices of the graph belong, and also designs nine degrees of freedom of multi-object pose estimation. In addition, to avoid gimbal lock in 3D space, we use quaternion rotation, instead of Euler angle. Experimental results showed that memory usage and efficiency could be improved by aggregating point features from the point cloud and their neighbors in a graph structure. Overall, the system achieved comparable performance against state-of-the-art systems.
format	Online Article Text
id	pubmed-9656959
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-96569592022-11-15 Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong Sensors (Basel) Article Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision, utilize conventional Convolutional Neural Networks (CNNs) to process data supported by graphs. This paper proposes a one-stage GCN approach for 3D object detection and poses estimation by structuring non-linearly distributed points of a graph. Our network provides the required details to analyze, generate and estimate bounding boxes by spatially structuring the input data into graphs. Our method proposes a keypoint attention mechanism that aggregates the relative features between each point to estimate the category and pose of the object to which the vertices of the graph belong, and also designs nine degrees of freedom of multi-object pose estimation. In addition, to avoid gimbal lock in 3D space, we use quaternion rotation, instead of Euler angle. Experimental results showed that memory usage and efficiency could be improved by aggregating point features from the point cloud and their neighbors in a graph structure. Overall, the system achieved comparable performance against state-of-the-art systems. MDPI 2022-10-25 /pmc/articles/PMC9656959/ /pubmed/36365864 http://dx.doi.org/10.3390/s22218166 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title	Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_full	Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_fullStr	Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_full_unstemmed	Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_short	Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
title_sort	graph convolutional network for 3d object pose estimation in a point cloud
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9656959/ https://www.ncbi.nlm.nih.gov/pubmed/36365864 http://dx.doi.org/10.3390/s22218166
work_keys_str_mv	AT jungtaewon graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT jeongchiseo graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT kiminseon graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT yuminsu graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT kwonsoonchul graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT jungkyedong graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud

Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud

Ejemplares similares