Cargando…

A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene

Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single ph...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Hongzhi, Wang, Jirong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422454/
https://www.ncbi.nlm.nih.gov/pubmed/37571707
http://dx.doi.org/10.3390/s23156924
_version_ 1785089214236327936
author Tian, Hongzhi
Wang, Jirong
author_facet Tian, Hongzhi
Wang, Jirong
author_sort Tian, Hongzhi
collection PubMed
description Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single photo of the same category. By employing a pre-trained You Only Look Once (YOLO) v4 weight for object detection and 2D model generation in the photo, the method converts the segmented 2D photo-model into 3D flat photo-models assuming different sizes and poses. Through perspective projection and model matching, the method finds the best match between the model and the actual object in the captured stereo images. The matching fitness function is optimized using a genetic algorithm (GA). Unlike data-driven approaches, this approach does not require multiple photos or pre-training time for single object pose recognition, making it more versatile. Indoor experiments demonstrate the effectiveness of the variable photo-model method in estimating the pose and size of the target objects within the same class. The findings of this study have practical implications for object detection prior to robotic grasping, particularly due to its ease of application and the limited data required.
format Online
Article
Text
id pubmed-10422454
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104224542023-08-13 A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene Tian, Hongzhi Wang, Jirong Sensors (Basel) Article Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single photo of the same category. By employing a pre-trained You Only Look Once (YOLO) v4 weight for object detection and 2D model generation in the photo, the method converts the segmented 2D photo-model into 3D flat photo-models assuming different sizes and poses. Through perspective projection and model matching, the method finds the best match between the model and the actual object in the captured stereo images. The matching fitness function is optimized using a genetic algorithm (GA). Unlike data-driven approaches, this approach does not require multiple photos or pre-training time for single object pose recognition, making it more versatile. Indoor experiments demonstrate the effectiveness of the variable photo-model method in estimating the pose and size of the target objects within the same class. The findings of this study have practical implications for object detection prior to robotic grasping, particularly due to its ease of application and the limited data required. MDPI 2023-08-03 /pmc/articles/PMC10422454/ /pubmed/37571707 http://dx.doi.org/10.3390/s23156924 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tian, Hongzhi
Wang, Jirong
A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_full A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_fullStr A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_full_unstemmed A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_short A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
title_sort variable photo-model method for object pose and size estimation with stereo vision in a complex home scene
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422454/
https://www.ncbi.nlm.nih.gov/pubmed/37571707
http://dx.doi.org/10.3390/s23156924
work_keys_str_mv AT tianhongzhi avariablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene
AT wangjirong avariablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene
AT tianhongzhi variablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene
AT wangjirong variablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene