Cargando…

A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences

Recently, deep convolutional neural networks (CNN) have become popular for indoor visual localisation, where the networks learn to regress the camera pose from images directly. However, these approaches perform a 3D image-based reconstruction of the indoor spaces beforehand to determine camera poses...

Descripción completa

Detalles Bibliográficos
Autores principales: Acharya, Debaditya, Singha Roy, Sesa, Khoshelham, Kourosh, Winter, Stephan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582800/
https://www.ncbi.nlm.nih.gov/pubmed/32992742
http://dx.doi.org/10.3390/s20195492
_version_ 1783599274520477696
author Acharya, Debaditya
Singha Roy, Sesa
Khoshelham, Kourosh
Winter, Stephan
author_facet Acharya, Debaditya
Singha Roy, Sesa
Khoshelham, Kourosh
Winter, Stephan
author_sort Acharya, Debaditya
collection PubMed
description Recently, deep convolutional neural networks (CNN) have become popular for indoor visual localisation, where the networks learn to regress the camera pose from images directly. However, these approaches perform a 3D image-based reconstruction of the indoor spaces beforehand to determine camera poses, which is a challenge for large indoor spaces. Synthetic images derived from 3D indoor models have been used to eliminate the requirement of 3D reconstruction. A limitation of the approach is the low accuracy that occurs as a result of estimating the pose of each image frame independently. In this article, a visual localisation approach is proposed that exploits the spatio-temporal information from synthetic image sequences to improve localisation accuracy. A deep Bayesian recurrent CNN is fine-tuned using synthetic image sequences obtained from a building information model (BIM) to regress the pose of real image sequences. The results of the experiments indicate that the proposed approach estimates a smoother trajectory with smaller inter-frame error as compared to existing methods. The achievable accuracy with the proposed approach is 1.6 m, which is an improvement of approximately thirty per cent compared to the existing approaches. A Keras implementation can be found in our Github repository.
format Online
Article
Text
id pubmed-7582800
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75828002020-10-28 A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences Acharya, Debaditya Singha Roy, Sesa Khoshelham, Kourosh Winter, Stephan Sensors (Basel) Article Recently, deep convolutional neural networks (CNN) have become popular for indoor visual localisation, where the networks learn to regress the camera pose from images directly. However, these approaches perform a 3D image-based reconstruction of the indoor spaces beforehand to determine camera poses, which is a challenge for large indoor spaces. Synthetic images derived from 3D indoor models have been used to eliminate the requirement of 3D reconstruction. A limitation of the approach is the low accuracy that occurs as a result of estimating the pose of each image frame independently. In this article, a visual localisation approach is proposed that exploits the spatio-temporal information from synthetic image sequences to improve localisation accuracy. A deep Bayesian recurrent CNN is fine-tuned using synthetic image sequences obtained from a building information model (BIM) to regress the pose of real image sequences. The results of the experiments indicate that the proposed approach estimates a smoother trajectory with smaller inter-frame error as compared to existing methods. The achievable accuracy with the proposed approach is 1.6 m, which is an improvement of approximately thirty per cent compared to the existing approaches. A Keras implementation can be found in our Github repository. MDPI 2020-09-25 /pmc/articles/PMC7582800/ /pubmed/32992742 http://dx.doi.org/10.3390/s20195492 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Acharya, Debaditya
Singha Roy, Sesa
Khoshelham, Kourosh
Winter, Stephan
A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences
title A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences
title_full A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences
title_fullStr A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences
title_full_unstemmed A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences
title_short A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences
title_sort recurrent deep network for estimating the pose of real indoor images from synthetic image sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582800/
https://www.ncbi.nlm.nih.gov/pubmed/32992742
http://dx.doi.org/10.3390/s20195492
work_keys_str_mv AT acharyadebaditya arecurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences
AT singharoysesa arecurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences
AT khoshelhamkourosh arecurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences
AT winterstephan arecurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences
AT acharyadebaditya recurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences
AT singharoysesa recurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences
AT khoshelhamkourosh recurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences
AT winterstephan recurrentdeepnetworkforestimatingtheposeofrealindoorimagesfromsyntheticimagesequences