Cargando…

A stereo matching algorithm based on the improved PSMNet

Deep learning based on a convolutional neural network (CNN) has been successfully applied to stereo matching. Compared with the traditional method, the speed and accuracy of this method have been greatly improved. However, the existing stereo matching framework based on a CNN often encounters two pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Zedong, Gu, Jinan, Li, Jing, Yu, Xuefei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376026/
https://www.ncbi.nlm.nih.gov/pubmed/34411098
http://dx.doi.org/10.1371/journal.pone.0251657
_version_ 1783740423140802560
author Huang, Zedong
Gu, Jinan
Li, Jing
Yu, Xuefei
author_facet Huang, Zedong
Gu, Jinan
Li, Jing
Yu, Xuefei
author_sort Huang, Zedong
collection PubMed
description Deep learning based on a convolutional neural network (CNN) has been successfully applied to stereo matching. Compared with the traditional method, the speed and accuracy of this method have been greatly improved. However, the existing stereo matching framework based on a CNN often encounters two problems. First, the existing stereo matching network has many parameters, which leads to the matching running time being too long. Second, the disparity estimation is inadequate in some regions where reflections, repeated textures, and fine structures may lead to ill-posed problems. Through the lightweight improvement of the PSMNet (Pyramid Stereo Matching Network) model, the common matching effect of ill-conditioned areas such as repeated texture areas and weak texture areas is solved. In the feature extraction part, ResNeXt is introduced to learn unitary feature extraction, and the ASPP (Atrous Spatial Pyramid Pooling) module is trained to extract multiscale spatial feature information. The feature fusion module is designed to effectively fuse the feature information of different scales to construct the matching cost volume. The improved 3D CNN uses the stacked encoding and decoding structure to further regularize the matching cost volume and obtain the corresponding relationship between feature points under different parallax conditions. Finally, the disparity map is obtained by a regression. We evaluate our method on the Scene Flow, KITTI 2012, and KITTI 2015 stereo datasets. The experiments show that the proposed stereo matching network achieves a comparable prediction accuracy and much faster running speed compared with PSMNet.
format Online
Article
Text
id pubmed-8376026
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83760262021-08-20 A stereo matching algorithm based on the improved PSMNet Huang, Zedong Gu, Jinan Li, Jing Yu, Xuefei PLoS One Research Article Deep learning based on a convolutional neural network (CNN) has been successfully applied to stereo matching. Compared with the traditional method, the speed and accuracy of this method have been greatly improved. However, the existing stereo matching framework based on a CNN often encounters two problems. First, the existing stereo matching network has many parameters, which leads to the matching running time being too long. Second, the disparity estimation is inadequate in some regions where reflections, repeated textures, and fine structures may lead to ill-posed problems. Through the lightweight improvement of the PSMNet (Pyramid Stereo Matching Network) model, the common matching effect of ill-conditioned areas such as repeated texture areas and weak texture areas is solved. In the feature extraction part, ResNeXt is introduced to learn unitary feature extraction, and the ASPP (Atrous Spatial Pyramid Pooling) module is trained to extract multiscale spatial feature information. The feature fusion module is designed to effectively fuse the feature information of different scales to construct the matching cost volume. The improved 3D CNN uses the stacked encoding and decoding structure to further regularize the matching cost volume and obtain the corresponding relationship between feature points under different parallax conditions. Finally, the disparity map is obtained by a regression. We evaluate our method on the Scene Flow, KITTI 2012, and KITTI 2015 stereo datasets. The experiments show that the proposed stereo matching network achieves a comparable prediction accuracy and much faster running speed compared with PSMNet. Public Library of Science 2021-08-19 /pmc/articles/PMC8376026/ /pubmed/34411098 http://dx.doi.org/10.1371/journal.pone.0251657 Text en © 2021 Huang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Huang, Zedong
Gu, Jinan
Li, Jing
Yu, Xuefei
A stereo matching algorithm based on the improved PSMNet
title A stereo matching algorithm based on the improved PSMNet
title_full A stereo matching algorithm based on the improved PSMNet
title_fullStr A stereo matching algorithm based on the improved PSMNet
title_full_unstemmed A stereo matching algorithm based on the improved PSMNet
title_short A stereo matching algorithm based on the improved PSMNet
title_sort stereo matching algorithm based on the improved psmnet
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376026/
https://www.ncbi.nlm.nih.gov/pubmed/34411098
http://dx.doi.org/10.1371/journal.pone.0251657
work_keys_str_mv AT huangzedong astereomatchingalgorithmbasedontheimprovedpsmnet
AT gujinan astereomatchingalgorithmbasedontheimprovedpsmnet
AT lijing astereomatchingalgorithmbasedontheimprovedpsmnet
AT yuxuefei astereomatchingalgorithmbasedontheimprovedpsmnet
AT huangzedong stereomatchingalgorithmbasedontheimprovedpsmnet
AT gujinan stereomatchingalgorithmbasedontheimprovedpsmnet
AT lijing stereomatchingalgorithmbasedontheimprovedpsmnet
AT yuxuefei stereomatchingalgorithmbasedontheimprovedpsmnet