Cargando…
Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation †
Deep learning approaches to estimating full 3D orientations of objects, in addition to object classes, are limited in their accuracies, due to the difficulty in learning the continuous nature of three-axis orientation variations by regression or classification with sufficient generalization. This pa...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8468838/ https://www.ncbi.nlm.nih.gov/pubmed/34577315 http://dx.doi.org/10.3390/s21186108 |
_version_ | 1784573774775975936 |
---|---|
author | Lee, Sukhan Yang, Yongjun |
author_facet | Lee, Sukhan Yang, Yongjun |
author_sort | Lee, Sukhan |
collection | PubMed |
description | Deep learning approaches to estimating full 3D orientations of objects, in addition to object classes, are limited in their accuracies, due to the difficulty in learning the continuous nature of three-axis orientation variations by regression or classification with sufficient generalization. This paper presents a novel progressive deep learning framework, herein referred to as 3D POCO Net, that offers high accuracy in estimating orientations about three rotational axes yet with efficiency in network complexity. The proposed 3D POCO Net is configured, using four PointNet-based networks for independently representing the object class and three individual axes of rotations. The four independent networks are linked by in-between association subnetworks that are trained to progressively map the global features learned by individual networks one after another for fine-tuning the independent networks. In 3D POCO Net, high accuracy is achieved by combining a high precision classification based on a large number of orientation classes with a regression based on a weighted sum of classification outputs, while high efficiency is maintained by a progressive framework by which a large number of orientation classes are grouped into independent networks linked by association subnetworks. We implemented 3D POCO Net for full three-axis orientation variations and trained it with about 146 million orientation variations augmented from the ModelNet10 dataset. The testing results show that we can achieve an orientation regression error of about 2.5° with about 90% accuracy in object classification for general three-axis orientation estimation and object classification. Furthermore, we demonstrate that a pre-trained 3D POCO Net can serve as an orientation representation platform based on which orientations as well as object classes of partial point clouds from occluded objects are learned in the form of transfer learning. |
format | Online Article Text |
id | pubmed-8468838 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84688382021-09-27 Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation † Lee, Sukhan Yang, Yongjun Sensors (Basel) Article Deep learning approaches to estimating full 3D orientations of objects, in addition to object classes, are limited in their accuracies, due to the difficulty in learning the continuous nature of three-axis orientation variations by regression or classification with sufficient generalization. This paper presents a novel progressive deep learning framework, herein referred to as 3D POCO Net, that offers high accuracy in estimating orientations about three rotational axes yet with efficiency in network complexity. The proposed 3D POCO Net is configured, using four PointNet-based networks for independently representing the object class and three individual axes of rotations. The four independent networks are linked by in-between association subnetworks that are trained to progressively map the global features learned by individual networks one after another for fine-tuning the independent networks. In 3D POCO Net, high accuracy is achieved by combining a high precision classification based on a large number of orientation classes with a regression based on a weighted sum of classification outputs, while high efficiency is maintained by a progressive framework by which a large number of orientation classes are grouped into independent networks linked by association subnetworks. We implemented 3D POCO Net for full three-axis orientation variations and trained it with about 146 million orientation variations augmented from the ModelNet10 dataset. The testing results show that we can achieve an orientation regression error of about 2.5° with about 90% accuracy in object classification for general three-axis orientation estimation and object classification. Furthermore, we demonstrate that a pre-trained 3D POCO Net can serve as an orientation representation platform based on which orientations as well as object classes of partial point clouds from occluded objects are learned in the form of transfer learning. MDPI 2021-09-12 /pmc/articles/PMC8468838/ /pubmed/34577315 http://dx.doi.org/10.3390/s21186108 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lee, Sukhan Yang, Yongjun Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation † |
title | Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation † |
title_full | Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation † |
title_fullStr | Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation † |
title_full_unstemmed | Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation † |
title_short | Progressive Deep Learning Framework for Recognizing 3D Orientations and Object Class Based on Point Cloud Representation † |
title_sort | progressive deep learning framework for recognizing 3d orientations and object class based on point cloud representation † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8468838/ https://www.ncbi.nlm.nih.gov/pubmed/34577315 http://dx.doi.org/10.3390/s21186108 |
work_keys_str_mv | AT leesukhan progressivedeeplearningframeworkforrecognizing3dorientationsandobjectclassbasedonpointcloudrepresentation AT yangyongjun progressivedeeplearningframeworkforrecognizing3dorientationsandobjectclassbasedonpointcloudrepresentation |