Cargando…
Efficient and Scalable Object Localization in 3D on Mobile Device
Two-Dimensional (2D) object detection has been an intensely discussed and researched field of computer vision. With numerous advancements made in the field over the years, we still need to identify a robust approach to efficiently conduct classification and localization of objects in our environment...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323171/ https://www.ncbi.nlm.nih.gov/pubmed/35877632 http://dx.doi.org/10.3390/jimaging8070188 |
_version_ | 1784756485005246464 |
---|---|
author | Gupta, Neetika Khan, Naimul Mefraz |
author_facet | Gupta, Neetika Khan, Naimul Mefraz |
author_sort | Gupta, Neetika |
collection | PubMed |
description | Two-Dimensional (2D) object detection has been an intensely discussed and researched field of computer vision. With numerous advancements made in the field over the years, we still need to identify a robust approach to efficiently conduct classification and localization of objects in our environment by just using our mobile devices. Moreover, 2D object detection limits the overall understanding of the detected object and does not provide any additional information in terms of its size and position in the real world. This work proposes an object localization solution in Three-Dimension (3D) for mobile devices using a novel approach. The proposed method works by combining a 2D object detection Convolutional Neural Network (CNN) model with Augmented Reality (AR) technologies to recognize objects in the environment and determine their real-world coordinates. We leverage the in-built Simultaneous Localization and Mapping (SLAM) capability of Google’s ARCore to detect planes and know the camera information for generating cuboid proposals from an object’s 2D bounding box. The proposed method is fast and efficient for identifying everyday objects in real-world space and, unlike mobile offloading techniques, the method is well designed to work with limited resources of a mobile device. |
format | Online Article Text |
id | pubmed-9323171 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93231712022-07-27 Efficient and Scalable Object Localization in 3D on Mobile Device Gupta, Neetika Khan, Naimul Mefraz J Imaging Article Two-Dimensional (2D) object detection has been an intensely discussed and researched field of computer vision. With numerous advancements made in the field over the years, we still need to identify a robust approach to efficiently conduct classification and localization of objects in our environment by just using our mobile devices. Moreover, 2D object detection limits the overall understanding of the detected object and does not provide any additional information in terms of its size and position in the real world. This work proposes an object localization solution in Three-Dimension (3D) for mobile devices using a novel approach. The proposed method works by combining a 2D object detection Convolutional Neural Network (CNN) model with Augmented Reality (AR) technologies to recognize objects in the environment and determine their real-world coordinates. We leverage the in-built Simultaneous Localization and Mapping (SLAM) capability of Google’s ARCore to detect planes and know the camera information for generating cuboid proposals from an object’s 2D bounding box. The proposed method is fast and efficient for identifying everyday objects in real-world space and, unlike mobile offloading techniques, the method is well designed to work with limited resources of a mobile device. MDPI 2022-07-08 /pmc/articles/PMC9323171/ /pubmed/35877632 http://dx.doi.org/10.3390/jimaging8070188 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Gupta, Neetika Khan, Naimul Mefraz Efficient and Scalable Object Localization in 3D on Mobile Device |
title | Efficient and Scalable Object Localization in 3D on Mobile Device |
title_full | Efficient and Scalable Object Localization in 3D on Mobile Device |
title_fullStr | Efficient and Scalable Object Localization in 3D on Mobile Device |
title_full_unstemmed | Efficient and Scalable Object Localization in 3D on Mobile Device |
title_short | Efficient and Scalable Object Localization in 3D on Mobile Device |
title_sort | efficient and scalable object localization in 3d on mobile device |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323171/ https://www.ncbi.nlm.nih.gov/pubmed/35877632 http://dx.doi.org/10.3390/jimaging8070188 |
work_keys_str_mv | AT guptaneetika efficientandscalableobjectlocalizationin3donmobiledevice AT khannaimulmefraz efficientandscalableobjectlocalizationin3donmobiledevice |