Cargando…
Self-supervised recurrent depth estimation with attention mechanisms
Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not requi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044223/ https://www.ncbi.nlm.nih.gov/pubmed/35494794 http://dx.doi.org/10.7717/peerj-cs.865 |
_version_ | 1784695058331598848 |
---|---|
author | Makarov, Ilya Bakhanova, Maria Nikolenko, Sergey Gerasimova, Olga |
author_facet | Makarov, Ilya Bakhanova, Maria Nikolenko, Sergey Gerasimova, Olga |
author_sort | Makarov, Ilya |
collection | PubMed |
description | Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced to make self-supervised depth prediction more precise. However, most existing approaches still focus on single-frame depth estimation, even in the self-supervised setting. Since most methods can operate with frame sequences, we believe that the quality of current models can be significantly improved with the help of information about previous frames. In this work, we study different ways of integrating recurrent blocks and attention mechanisms into a common self-supervised depth estimation pipeline. We propose a set of modifications that utilize temporal information from previous frames and provide new neural network architectures for monocular depth estimation in a self-supervised manner. Our experiments on the KITTI dataset show that proposed modifications can be an effective tool for exploiting temporal information in a depth prediction pipeline. |
format | Online Article Text |
id | pubmed-9044223 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90442232022-04-28 Self-supervised recurrent depth estimation with attention mechanisms Makarov, Ilya Bakhanova, Maria Nikolenko, Sergey Gerasimova, Olga PeerJ Comput Sci Artificial Intelligence Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced to make self-supervised depth prediction more precise. However, most existing approaches still focus on single-frame depth estimation, even in the self-supervised setting. Since most methods can operate with frame sequences, we believe that the quality of current models can be significantly improved with the help of information about previous frames. In this work, we study different ways of integrating recurrent blocks and attention mechanisms into a common self-supervised depth estimation pipeline. We propose a set of modifications that utilize temporal information from previous frames and provide new neural network architectures for monocular depth estimation in a self-supervised manner. Our experiments on the KITTI dataset show that proposed modifications can be an effective tool for exploiting temporal information in a depth prediction pipeline. PeerJ Inc. 2022-01-31 /pmc/articles/PMC9044223/ /pubmed/35494794 http://dx.doi.org/10.7717/peerj-cs.865 Text en ©2022 Makarov et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Artificial Intelligence Makarov, Ilya Bakhanova, Maria Nikolenko, Sergey Gerasimova, Olga Self-supervised recurrent depth estimation with attention mechanisms |
title | Self-supervised recurrent depth estimation with attention mechanisms |
title_full | Self-supervised recurrent depth estimation with attention mechanisms |
title_fullStr | Self-supervised recurrent depth estimation with attention mechanisms |
title_full_unstemmed | Self-supervised recurrent depth estimation with attention mechanisms |
title_short | Self-supervised recurrent depth estimation with attention mechanisms |
title_sort | self-supervised recurrent depth estimation with attention mechanisms |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044223/ https://www.ncbi.nlm.nih.gov/pubmed/35494794 http://dx.doi.org/10.7717/peerj-cs.865 |
work_keys_str_mv | AT makarovilya selfsupervisedrecurrentdepthestimationwithattentionmechanisms AT bakhanovamaria selfsupervisedrecurrentdepthestimationwithattentionmechanisms AT nikolenkosergey selfsupervisedrecurrentdepthestimationwithattentionmechanisms AT gerasimovaolga selfsupervisedrecurrentdepthestimationwithattentionmechanisms |