Cargando…
Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision,...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9656959/ https://www.ncbi.nlm.nih.gov/pubmed/36365864 http://dx.doi.org/10.3390/s22218166 |
_version_ | 1784829569940848640 |
---|---|
author | Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong |
author_facet | Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong |
author_sort | Jung, Tae-Won |
collection | PubMed |
description | Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision, utilize conventional Convolutional Neural Networks (CNNs) to process data supported by graphs. This paper proposes a one-stage GCN approach for 3D object detection and poses estimation by structuring non-linearly distributed points of a graph. Our network provides the required details to analyze, generate and estimate bounding boxes by spatially structuring the input data into graphs. Our method proposes a keypoint attention mechanism that aggregates the relative features between each point to estimate the category and pose of the object to which the vertices of the graph belong, and also designs nine degrees of freedom of multi-object pose estimation. In addition, to avoid gimbal lock in 3D space, we use quaternion rotation, instead of Euler angle. Experimental results showed that memory usage and efficiency could be improved by aggregating point features from the point cloud and their neighbors in a graph structure. Overall, the system achieved comparable performance against state-of-the-art systems. |
format | Online Article Text |
id | pubmed-9656959 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96569592022-11-15 Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong Sensors (Basel) Article Graph Neural Networks (GNNs) are neural networks that learn the representation of nodes and associated edges that connect it to every other node while maintaining graph representation. Graph Convolutional Neural Networks (GCNs), as a representative method in GNNs, in the context of computer vision, utilize conventional Convolutional Neural Networks (CNNs) to process data supported by graphs. This paper proposes a one-stage GCN approach for 3D object detection and poses estimation by structuring non-linearly distributed points of a graph. Our network provides the required details to analyze, generate and estimate bounding boxes by spatially structuring the input data into graphs. Our method proposes a keypoint attention mechanism that aggregates the relative features between each point to estimate the category and pose of the object to which the vertices of the graph belong, and also designs nine degrees of freedom of multi-object pose estimation. In addition, to avoid gimbal lock in 3D space, we use quaternion rotation, instead of Euler angle. Experimental results showed that memory usage and efficiency could be improved by aggregating point features from the point cloud and their neighbors in a graph structure. Overall, the system achieved comparable performance against state-of-the-art systems. MDPI 2022-10-25 /pmc/articles/PMC9656959/ /pubmed/36365864 http://dx.doi.org/10.3390/s22218166 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jung, Tae-Won Jeong, Chi-Seo Kim, In-Seon Yu, Min-Su Kwon, Soon-Chul Jung, Kye-Dong Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud |
title | Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud |
title_full | Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud |
title_fullStr | Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud |
title_full_unstemmed | Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud |
title_short | Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud |
title_sort | graph convolutional network for 3d object pose estimation in a point cloud |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9656959/ https://www.ncbi.nlm.nih.gov/pubmed/36365864 http://dx.doi.org/10.3390/s22218166 |
work_keys_str_mv | AT jungtaewon graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT jeongchiseo graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT kiminseon graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT yuminsu graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT kwonsoonchul graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud AT jungkyedong graphconvolutionalnetworkfor3dobjectposeestimationinapointcloud |