Cargando…
A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene
Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single ph...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422454/ https://www.ncbi.nlm.nih.gov/pubmed/37571707 http://dx.doi.org/10.3390/s23156924 |
_version_ | 1785089214236327936 |
---|---|
author | Tian, Hongzhi Wang, Jirong |
author_facet | Tian, Hongzhi Wang, Jirong |
author_sort | Tian, Hongzhi |
collection | PubMed |
description | Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single photo of the same category. By employing a pre-trained You Only Look Once (YOLO) v4 weight for object detection and 2D model generation in the photo, the method converts the segmented 2D photo-model into 3D flat photo-models assuming different sizes and poses. Through perspective projection and model matching, the method finds the best match between the model and the actual object in the captured stereo images. The matching fitness function is optimized using a genetic algorithm (GA). Unlike data-driven approaches, this approach does not require multiple photos or pre-training time for single object pose recognition, making it more versatile. Indoor experiments demonstrate the effectiveness of the variable photo-model method in estimating the pose and size of the target objects within the same class. The findings of this study have practical implications for object detection prior to robotic grasping, particularly due to its ease of application and the limited data required. |
format | Online Article Text |
id | pubmed-10422454 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-104224542023-08-13 A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene Tian, Hongzhi Wang, Jirong Sensors (Basel) Article Model-based stereo vision methods can estimate the 6D poses of rigid objects. They can help robots to achieve a target grip in complex home environments. This study presents a novel approach, called the variable photo-model method, to estimate the pose and size of an unknown object using a single photo of the same category. By employing a pre-trained You Only Look Once (YOLO) v4 weight for object detection and 2D model generation in the photo, the method converts the segmented 2D photo-model into 3D flat photo-models assuming different sizes and poses. Through perspective projection and model matching, the method finds the best match between the model and the actual object in the captured stereo images. The matching fitness function is optimized using a genetic algorithm (GA). Unlike data-driven approaches, this approach does not require multiple photos or pre-training time for single object pose recognition, making it more versatile. Indoor experiments demonstrate the effectiveness of the variable photo-model method in estimating the pose and size of the target objects within the same class. The findings of this study have practical implications for object detection prior to robotic grasping, particularly due to its ease of application and the limited data required. MDPI 2023-08-03 /pmc/articles/PMC10422454/ /pubmed/37571707 http://dx.doi.org/10.3390/s23156924 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tian, Hongzhi Wang, Jirong A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene |
title | A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene |
title_full | A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene |
title_fullStr | A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene |
title_full_unstemmed | A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene |
title_short | A Variable Photo-Model Method for Object Pose and Size Estimation with Stereo Vision in a Complex Home Scene |
title_sort | variable photo-model method for object pose and size estimation with stereo vision in a complex home scene |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422454/ https://www.ncbi.nlm.nih.gov/pubmed/37571707 http://dx.doi.org/10.3390/s23156924 |
work_keys_str_mv | AT tianhongzhi avariablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene AT wangjirong avariablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene AT tianhongzhi variablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene AT wangjirong variablephotomodelmethodforobjectposeandsizeestimationwithstereovisioninacomplexhomescene |