Cargando…

SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training

Estimation and tracking of 6DoF poses of objects in images is a challenging problem of great importance for robotic interaction and augmented reality. Recent approaches applying deep neural networks for pose estimation have shown encouraging results. However, most of them rely on training with real...

Descripción completa

Detalles Bibliográficos
Autores principales:	Su, Yongzhi, Rambach, Jason, Pagani, Alain, Stricker, Didier
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7796199/ https://www.ncbi.nlm.nih.gov/pubmed/33466293 http://dx.doi.org/10.3390/s21010300

_version_	1783634625783922688
author	Su, Yongzhi Rambach, Jason Pagani, Alain Stricker, Didier
author_facet	Su, Yongzhi Rambach, Jason Pagani, Alain Stricker, Didier
author_sort	Su, Yongzhi
collection	PubMed
description	Estimation and tracking of 6DoF poses of objects in images is a challenging problem of great importance for robotic interaction and augmented reality. Recent approaches applying deep neural networks for pose estimation have shown encouraging results. However, most of them rely on training with real images of objects with severe limitations concerning ground truth pose acquisition, full coverage of possible poses, and training dataset scaling and generalization capability. This paper presents a novel approach using a Convolutional Neural Network (CNN) trained exclusively on single-channel Synthetic images of objects to regress 6DoF object Poses directly (SynPo-Net). The proposed SynPo-Net is a network architecture specifically designed for pose regression and a proposed domain adaptation scheme transforming real and synthetic images into an intermediate domain that is better fit for establishing correspondences. The extensive evaluation shows that our approach significantly outperforms the state-of-the-art using synthetic training in terms of both accuracy and speed. Our system can be used to estimate the 6DoF pose from a single frame, or be integrated into a tracking system to provide the initial pose.
format	Online Article Text
id	pubmed-7796199
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-77961992021-01-10 SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training Su, Yongzhi Rambach, Jason Pagani, Alain Stricker, Didier Sensors (Basel) Article Estimation and tracking of 6DoF poses of objects in images is a challenging problem of great importance for robotic interaction and augmented reality. Recent approaches applying deep neural networks for pose estimation have shown encouraging results. However, most of them rely on training with real images of objects with severe limitations concerning ground truth pose acquisition, full coverage of possible poses, and training dataset scaling and generalization capability. This paper presents a novel approach using a Convolutional Neural Network (CNN) trained exclusively on single-channel Synthetic images of objects to regress 6DoF object Poses directly (SynPo-Net). The proposed SynPo-Net is a network architecture specifically designed for pose regression and a proposed domain adaptation scheme transforming real and synthetic images into an intermediate domain that is better fit for establishing correspondences. The extensive evaluation shows that our approach significantly outperforms the state-of-the-art using synthetic training in terms of both accuracy and speed. Our system can be used to estimate the 6DoF pose from a single frame, or be integrated into a tracking system to provide the initial pose. MDPI 2021-01-05 /pmc/articles/PMC7796199/ /pubmed/33466293 http://dx.doi.org/10.3390/s21010300 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Su, Yongzhi Rambach, Jason Pagani, Alain Stricker, Didier SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
title	SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
title_full	SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
title_fullStr	SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
title_full_unstemmed	SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
title_short	SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
title_sort	synpo-net—accurate and fast cnn-based 6dof object pose estimation using synthetic training
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7796199/ https://www.ncbi.nlm.nih.gov/pubmed/33466293 http://dx.doi.org/10.3390/s21010300
work_keys_str_mv	AT suyongzhi synponetaccurateandfastcnnbased6dofobjectposeestimationusingsynthetictraining AT rambachjason synponetaccurateandfastcnnbased6dofobjectposeestimationusingsynthetictraining AT paganialain synponetaccurateandfastcnnbased6dofobjectposeestimationusingsynthetictraining AT strickerdidier synponetaccurateandfastcnnbased6dofobjectposeestimationusingsynthetictraining

SynPo-Net—Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training

Ejemplares similares