Cargando…

ModuleNet: A Convolutional Neural Network for Stereo Vision

Convolutional Neural Networks (CNN) has gained much attention for the solution of numerous vision problems including disparities calculation in stereo vision systems. In this paper, we present a CNN based solution for disparities estimation that builds upon a basic module (BM) with limited range of...

Descripción completa

Detalles Bibliográficos
Autores principales: Renteria-Vidales, O. I., Cuevas-Tello, J. C., Reyes-Figueroa, A., Rivera, M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7297576/
http://dx.doi.org/10.1007/978-3-030-49076-8_21
_version_ 1783547035025145856
author Renteria-Vidales, O. I.
Cuevas-Tello, J. C.
Reyes-Figueroa, A.
Rivera, M.
author_facet Renteria-Vidales, O. I.
Cuevas-Tello, J. C.
Reyes-Figueroa, A.
Rivera, M.
author_sort Renteria-Vidales, O. I.
collection PubMed
description Convolutional Neural Networks (CNN) has gained much attention for the solution of numerous vision problems including disparities calculation in stereo vision systems. In this paper, we present a CNN based solution for disparities estimation that builds upon a basic module (BM) with limited range of disparities that can be extended using various BM in parallel. Our BM can be understood as a segmentation by disparity and produces an output channel with the memberships for each disparity candidate, additionally the BM computes a channel with the out–of–range disparity regions. This extra channel allows us to parallelize several BM and dealing with their respective responsibilities. We train our model with the MPI Sintel dataset. The results show that ModuleNet, our modular CNN model, outperforms the baseline algorithm Efficient Large-scale Stereo Matching (ELAS) and FlowNetC achieving about a 80% of improvement.
format Online
Article
Text
id pubmed-7297576
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72975762020-06-17 ModuleNet: A Convolutional Neural Network for Stereo Vision Renteria-Vidales, O. I. Cuevas-Tello, J. C. Reyes-Figueroa, A. Rivera, M. Pattern Recognition Article Convolutional Neural Networks (CNN) has gained much attention for the solution of numerous vision problems including disparities calculation in stereo vision systems. In this paper, we present a CNN based solution for disparities estimation that builds upon a basic module (BM) with limited range of disparities that can be extended using various BM in parallel. Our BM can be understood as a segmentation by disparity and produces an output channel with the memberships for each disparity candidate, additionally the BM computes a channel with the out–of–range disparity regions. This extra channel allows us to parallelize several BM and dealing with their respective responsibilities. We train our model with the MPI Sintel dataset. The results show that ModuleNet, our modular CNN model, outperforms the baseline algorithm Efficient Large-scale Stereo Matching (ELAS) and FlowNetC achieving about a 80% of improvement. 2020-04-29 /pmc/articles/PMC7297576/ http://dx.doi.org/10.1007/978-3-030-49076-8_21 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Renteria-Vidales, O. I.
Cuevas-Tello, J. C.
Reyes-Figueroa, A.
Rivera, M.
ModuleNet: A Convolutional Neural Network for Stereo Vision
title ModuleNet: A Convolutional Neural Network for Stereo Vision
title_full ModuleNet: A Convolutional Neural Network for Stereo Vision
title_fullStr ModuleNet: A Convolutional Neural Network for Stereo Vision
title_full_unstemmed ModuleNet: A Convolutional Neural Network for Stereo Vision
title_short ModuleNet: A Convolutional Neural Network for Stereo Vision
title_sort modulenet: a convolutional neural network for stereo vision
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7297576/
http://dx.doi.org/10.1007/978-3-030-49076-8_21
work_keys_str_mv AT renteriavidalesoi modulenetaconvolutionalneuralnetworkforstereovision
AT cuevastellojc modulenetaconvolutionalneuralnetworkforstereovision
AT reyesfigueroaa modulenetaconvolutionalneuralnetworkforstereovision
AT riveram modulenetaconvolutionalneuralnetworkforstereovision