Cargando…

Self-supervised recurrent depth estimation with attention mechanisms

Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not requi...

Descripción completa

Detalles Bibliográficos
Autores principales: Makarov, Ilya, Bakhanova, Maria, Nikolenko, Sergey, Gerasimova, Olga
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044223/
https://www.ncbi.nlm.nih.gov/pubmed/35494794
http://dx.doi.org/10.7717/peerj-cs.865
_version_ 1784695058331598848
author Makarov, Ilya
Bakhanova, Maria
Nikolenko, Sergey
Gerasimova, Olga
author_facet Makarov, Ilya
Bakhanova, Maria
Nikolenko, Sergey
Gerasimova, Olga
author_sort Makarov, Ilya
collection PubMed
description Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced to make self-supervised depth prediction more precise. However, most existing approaches still focus on single-frame depth estimation, even in the self-supervised setting. Since most methods can operate with frame sequences, we believe that the quality of current models can be significantly improved with the help of information about previous frames. In this work, we study different ways of integrating recurrent blocks and attention mechanisms into a common self-supervised depth estimation pipeline. We propose a set of modifications that utilize temporal information from previous frames and provide new neural network architectures for monocular depth estimation in a self-supervised manner. Our experiments on the KITTI dataset show that proposed modifications can be an effective tool for exploiting temporal information in a depth prediction pipeline.
format Online
Article
Text
id pubmed-9044223
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-90442232022-04-28 Self-supervised recurrent depth estimation with attention mechanisms Makarov, Ilya Bakhanova, Maria Nikolenko, Sergey Gerasimova, Olga PeerJ Comput Sci Artificial Intelligence Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced to make self-supervised depth prediction more precise. However, most existing approaches still focus on single-frame depth estimation, even in the self-supervised setting. Since most methods can operate with frame sequences, we believe that the quality of current models can be significantly improved with the help of information about previous frames. In this work, we study different ways of integrating recurrent blocks and attention mechanisms into a common self-supervised depth estimation pipeline. We propose a set of modifications that utilize temporal information from previous frames and provide new neural network architectures for monocular depth estimation in a self-supervised manner. Our experiments on the KITTI dataset show that proposed modifications can be an effective tool for exploiting temporal information in a depth prediction pipeline. PeerJ Inc. 2022-01-31 /pmc/articles/PMC9044223/ /pubmed/35494794 http://dx.doi.org/10.7717/peerj-cs.865 Text en ©2022 Makarov et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Makarov, Ilya
Bakhanova, Maria
Nikolenko, Sergey
Gerasimova, Olga
Self-supervised recurrent depth estimation with attention mechanisms
title Self-supervised recurrent depth estimation with attention mechanisms
title_full Self-supervised recurrent depth estimation with attention mechanisms
title_fullStr Self-supervised recurrent depth estimation with attention mechanisms
title_full_unstemmed Self-supervised recurrent depth estimation with attention mechanisms
title_short Self-supervised recurrent depth estimation with attention mechanisms
title_sort self-supervised recurrent depth estimation with attention mechanisms
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044223/
https://www.ncbi.nlm.nih.gov/pubmed/35494794
http://dx.doi.org/10.7717/peerj-cs.865
work_keys_str_mv AT makarovilya selfsupervisedrecurrentdepthestimationwithattentionmechanisms
AT bakhanovamaria selfsupervisedrecurrentdepthestimationwithattentionmechanisms
AT nikolenkosergey selfsupervisedrecurrentdepthestimationwithattentionmechanisms
AT gerasimovaolga selfsupervisedrecurrentdepthestimationwithattentionmechanisms