Cargando…

An application of stereo matching algorithm based on transfer learning on robots in multiple scenes

Robot vision technology based on binocular vision holds tremendous potential for development in various fields, including 3D scene reconstruction, target detection, and autonomous driving. However, current binocular vision methods used in robotics engineering have limitations such as high costs, com...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bi, Yuanwei, Li, Chuanbiao, Tong, Xiangrong, Wang, Guohui, Sun, Haiwei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10404586/ https://www.ncbi.nlm.nih.gov/pubmed/37544958 http://dx.doi.org/10.1038/s41598-023-39964-z

_version_	1785085332093403136
author	Bi, Yuanwei Li, Chuanbiao Tong, Xiangrong Wang, Guohui Sun, Haiwei
author_facet	Bi, Yuanwei Li, Chuanbiao Tong, Xiangrong Wang, Guohui Sun, Haiwei
author_sort	Bi, Yuanwei
collection	PubMed
description	Robot vision technology based on binocular vision holds tremendous potential for development in various fields, including 3D scene reconstruction, target detection, and autonomous driving. However, current binocular vision methods used in robotics engineering have limitations such as high costs, complex algorithms, and low reliability of the generated disparity map in different scenes. To overcome these challenges, a cross-domain stereo matching algorithm for binocular vision based on transfer learning was proposed in this paper, named Cross-Domain Adaptation and Transfer Learning Network (Ct-Net), which has shown valuable results in multiple robot scenes. First, this paper introduces a General Feature Extractor to extract rich general feature information for domain adaptive stereo matching tasks. Then, a feature adapter is used to adapt the general features to the stereo matching network. Furthermore, a Domain Adaptive Cost Optimization Module is designed to optimize the matching cost. A disparity score prediction module was also embedded to adaptively adjust the search range of disparity and optimize the cost distribution. The overall framework was trained using a phased strategy, and ablation experiments were conducted to verify the effectiveness of the training strategy. Compared with the prototype PSMNet, on KITTI 2015 benchmark, the 3PE-fg of Ct-Net in all regions and non-occluded regions decreased by 19.3 and 21.1% respectively, meanwhile, on the Middlebury dataset, the proposed algorithm improves the sample error rate at least 28.4%, which is the Staircase sample. The quantitative and qualitative results obtained from Middlebury, Apollo, and other datasets demonstrate that Ct-Net significantly improves the cross-domain performance of stereo matching. Stereo matching experiments in real-world scenes have shown that it can effectively address visual tasks in multiple scenes.
format	Online Article Text
id	pubmed-10404586
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-104045862023-08-08 An application of stereo matching algorithm based on transfer learning on robots in multiple scenes Bi, Yuanwei Li, Chuanbiao Tong, Xiangrong Wang, Guohui Sun, Haiwei Sci Rep Article Robot vision technology based on binocular vision holds tremendous potential for development in various fields, including 3D scene reconstruction, target detection, and autonomous driving. However, current binocular vision methods used in robotics engineering have limitations such as high costs, complex algorithms, and low reliability of the generated disparity map in different scenes. To overcome these challenges, a cross-domain stereo matching algorithm for binocular vision based on transfer learning was proposed in this paper, named Cross-Domain Adaptation and Transfer Learning Network (Ct-Net), which has shown valuable results in multiple robot scenes. First, this paper introduces a General Feature Extractor to extract rich general feature information for domain adaptive stereo matching tasks. Then, a feature adapter is used to adapt the general features to the stereo matching network. Furthermore, a Domain Adaptive Cost Optimization Module is designed to optimize the matching cost. A disparity score prediction module was also embedded to adaptively adjust the search range of disparity and optimize the cost distribution. The overall framework was trained using a phased strategy, and ablation experiments were conducted to verify the effectiveness of the training strategy. Compared with the prototype PSMNet, on KITTI 2015 benchmark, the 3PE-fg of Ct-Net in all regions and non-occluded regions decreased by 19.3 and 21.1% respectively, meanwhile, on the Middlebury dataset, the proposed algorithm improves the sample error rate at least 28.4%, which is the Staircase sample. The quantitative and qualitative results obtained from Middlebury, Apollo, and other datasets demonstrate that Ct-Net significantly improves the cross-domain performance of stereo matching. Stereo matching experiments in real-world scenes have shown that it can effectively address visual tasks in multiple scenes. Nature Publishing Group UK 2023-08-06 /pmc/articles/PMC10404586/ /pubmed/37544958 http://dx.doi.org/10.1038/s41598-023-39964-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Bi, Yuanwei Li, Chuanbiao Tong, Xiangrong Wang, Guohui Sun, Haiwei An application of stereo matching algorithm based on transfer learning on robots in multiple scenes
title	An application of stereo matching algorithm based on transfer learning on robots in multiple scenes
title_full	An application of stereo matching algorithm based on transfer learning on robots in multiple scenes
title_fullStr	An application of stereo matching algorithm based on transfer learning on robots in multiple scenes
title_full_unstemmed	An application of stereo matching algorithm based on transfer learning on robots in multiple scenes
title_short	An application of stereo matching algorithm based on transfer learning on robots in multiple scenes
title_sort	application of stereo matching algorithm based on transfer learning on robots in multiple scenes
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10404586/ https://www.ncbi.nlm.nih.gov/pubmed/37544958 http://dx.doi.org/10.1038/s41598-023-39964-z
work_keys_str_mv	AT biyuanwei anapplicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT lichuanbiao anapplicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT tongxiangrong anapplicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT wangguohui anapplicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT sunhaiwei anapplicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT biyuanwei applicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT lichuanbiao applicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT tongxiangrong applicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT wangguohui applicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes AT sunhaiwei applicationofstereomatchingalgorithmbasedontransferlearningonrobotsinmultiplescenes

An application of stereo matching algorithm based on transfer learning on robots in multiple scenes

Ejemplares similares