Cargando…

A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene

Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single ph...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tian, Hongzhi, Wang, Jirong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422454/ https://www.ncbi.nlm.nih.gov/pubmed/37571707 http://dx.doi.org/10.3390/s23156924

_version_	1785089214236327936
author	Tian, Hongzhi Wang, Jirong
author_facet	Tian, Hongzhi Wang, Jirong
author_sort	Tian, Hongzhi
collection	PubMed
description	Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single photo of the same category. By employing a pre-trained You Only Look Once (YOLO) v4 weight for object detection and 2D model generation in the photo, the method converts the segmented 2D photo-model into 3D flat photo-models assuming different sizes and poses. Through perspective projection and model matching, the method finds the best match between the model and the actual object in the captured stereo images. The matching fitness function is optimized using a genetic algorithm (GA). Unlike data-driven approaches, this approach does not require multiple photos or pre-training time for single object pose recognition, making it more versatile. Indoor experiments demonstrate the effectiveness of the variable photo-model method in estimating the pose and size of the target objects within the same class. The findings of this study have practical implications for object detection prior to robotic grasping, particularly due to its ease of application and the limited data required.
format	Online Article Text
id	pubmed-10422454
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-104224542023-08-13 A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene Tian, Hongzhi Wang, Jirong Sensors (Basel) Article Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single photo of the same category. By employing a pre-trained You Only Look Once (YOLO) v4 weight for object detection and 2D model generation in the photo, the method converts the segmented 2D photo-model into 3D flat photo-models assuming different sizes and poses. Through perspective projection and model matching, the method finds the best match between the model and the actual object in the captured stereo images. The matching fitness function is optimized using a genetic algorithm (GA). Unlike data-driven approaches, this approach does not require multiple photos or pre-training time for single object pose recognition, making it more versatile. Indoor experiments demonstrate the effectiveness of the variable photo-model method in estimating the pose and size of the target objects within the same class. The findings of this study have practical implications for object detection prior to robotic grasping, particularly due to its ease of application and the limited data required. MDPI 2023-08-03 /pmc/articles/PMC10422454/ /pubmed/37571707 http://dx.doi.org/10.3390/s23156924 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Tian, Hongzhi Wang, Jirong A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title	A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_full	A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_fullStr	A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_full_unstemmed	A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_short	A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_sort	variable photo-model method for object pose and size estimation with stereo vision in a complex home scene
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422454/ https://www.ncbi.nlm.nih.gov/pubmed/37571707 http://dx.doi.org/10.3390/s23156924
work_keys_str_mv	AT tianhongzhi avariablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene AT wangjirong avariablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene AT tianhongzhi variablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene AT wangjirong variablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene

A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene

Ejemplares similares