Cargando…

Estimation of 6D Object Pose Using a 2D Bounding Box

This paper provides an efficient way of addressing the problem of detecting or estimating the 6-Dimensional (6D) pose of objects from an RGB image. A quaternion is used to define an object′s three-dimensional pose, but the pose represented by q and the pose represented by -q are equivalent, and the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hong, Yong, Liu, Jin, Jahangir, Zahid, He, Sheng, Zhang, Qing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8122747/ https://www.ncbi.nlm.nih.gov/pubmed/33922124 http://dx.doi.org/10.3390/s21092939

_version_	1783692703181045760
author	Hong, Yong Liu, Jin Jahangir, Zahid He, Sheng Zhang, Qing
author_facet	Hong, Yong Liu, Jin Jahangir, Zahid He, Sheng Zhang, Qing
author_sort	Hong, Yong
collection	PubMed
description	This paper provides an efficient way of addressing the problem of detecting or estimating the 6-Dimensional (6D) pose of objects from an RGB image. A quaternion is used to define an object′s three-dimensional pose, but the pose represented by q and the pose represented by -q are equivalent, and the L2 loss between them is very large. Therefore, we define a new quaternion pose loss function to solve this problem. Based on this, we designed a new convolutional neural network named Q-Net to estimate an object’s pose. Considering that the quaternion′s output is a unit vector, a normalization layer is added in Q-Net to hold the output of pose on a four-dimensional unit sphere. We propose a new algorithm, called the Bounding Box Equation, to obtain 3D translation quickly and effectively from 2D bounding boxes. The algorithm uses an entirely new way of assessing the 3D rotation (R) and 3D translation rotation (t) in only one RGB image. This method can upgrade any traditional 2D-box prediction algorithm to a 3D prediction model. We evaluated our model using the LineMod dataset, and experiments have shown that our methodology is more acceptable and efficient in terms of L2 loss and computational time.
format	Online Article Text
id	pubmed-8122747
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-81227472021-05-16 Estimation of 6D Object Pose Using a 2D Bounding Box Hong, Yong Liu, Jin Jahangir, Zahid He, Sheng Zhang, Qing Sensors (Basel) Article This paper provides an efficient way of addressing the problem of detecting or estimating the 6-Dimensional (6D) pose of objects from an RGB image. A quaternion is used to define an object′s three-dimensional pose, but the pose represented by q and the pose represented by -q are equivalent, and the L2 loss between them is very large. Therefore, we define a new quaternion pose loss function to solve this problem. Based on this, we designed a new convolutional neural network named Q-Net to estimate an object’s pose. Considering that the quaternion′s output is a unit vector, a normalization layer is added in Q-Net to hold the output of pose on a four-dimensional unit sphere. We propose a new algorithm, called the Bounding Box Equation, to obtain 3D translation quickly and effectively from 2D bounding boxes. The algorithm uses an entirely new way of assessing the 3D rotation (R) and 3D translation rotation (t) in only one RGB image. This method can upgrade any traditional 2D-box prediction algorithm to a 3D prediction model. We evaluated our model using the LineMod dataset, and experiments have shown that our methodology is more acceptable and efficient in terms of L2 loss and computational time. MDPI 2021-04-22 /pmc/articles/PMC8122747/ /pubmed/33922124 http://dx.doi.org/10.3390/s21092939 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Hong, Yong Liu, Jin Jahangir, Zahid He, Sheng Zhang, Qing Estimation of 6D Object Pose Using a 2D Bounding Box
title	Estimation of 6D Object Pose Using a 2D Bounding Box
title_full	Estimation of 6D Object Pose Using a 2D Bounding Box
title_fullStr	Estimation of 6D Object Pose Using a 2D Bounding Box
title_full_unstemmed	Estimation of 6D Object Pose Using a 2D Bounding Box
title_short	Estimation of 6D Object Pose Using a 2D Bounding Box
title_sort	estimation of 6d object pose using a 2d bounding box
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8122747/ https://www.ncbi.nlm.nih.gov/pubmed/33922124 http://dx.doi.org/10.3390/s21092939
work_keys_str_mv	AT hongyong estimationof6dobjectposeusinga2dboundingbox AT liujin estimationof6dobjectposeusinga2dboundingbox AT jahangirzahid estimationof6dobjectposeusinga2dboundingbox AT hesheng estimationof6dobjectposeusinga2dboundingbox AT zhangqing estimationof6dobjectposeusinga2dboundingbox

Estimation of 6D Object Pose Using a 2D Bounding Box

Ejemplares similares