Cargando…

Grid Based Spherical CNN for Object Detection from Panoramic Images

Recently proposed spherical convolutional neural networks (SCNNs) have shown advantages over conventional planar CNNs on classifying spherical images. However, two factors hamper their application in an objection detection task. First, a convolution in S2 (a two-dimensional sphere in three-dimension...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yu, Dawen, Ji, Shunping
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6603645/ https://www.ncbi.nlm.nih.gov/pubmed/31181854 http://dx.doi.org/10.3390/s19112622

_version_	1783431553012989952
author	Yu, Dawen Ji, Shunping
author_facet	Yu, Dawen Ji, Shunping
author_sort	Yu, Dawen
collection	PubMed
description	Recently proposed spherical convolutional neural networks (SCNNs) have shown advantages over conventional planar CNNs on classifying spherical images. However, two factors hamper their application in an objection detection task. First, a convolution in S2 (a two-dimensional sphere in three-dimensional space) or SO(3) (three-dimensional special orthogonal group) space results in the loss of an object’s location. Second, overlarge bandwidth is required to preserve a small object’s information on a sphere because the S2/SO(3) convolution must be performed on the whole sphere, instead of a local image patch. In this study, we propose a novel grid-based spherical CNN (G-SCNN) for detecting objects from spherical images. According to input bandwidth, a sphere image is transformed to a conformal grid map to be the input of the S2/SO3 convolution, and an object’s bounding box is scaled to cover an adequate area of the grid map. This solves the second problem. For the first problem, we utilize a planar region proposal network (RPN) with a data augmentation strategy that increases rotation invariance. We have also created a dataset including 600 street view panoramic images captured from a vehicle-borne panoramic camera. The dataset contains 5636 objects of interest annotated with class and bounding box and is named as WHU (Wuhan University) panoramic dataset. Results on the dataset proved our grid-based method is extremely better than the original SCNN in detecting objects from spherical images, and it outperformed several mainstream object detection networks, such as Faster R-CNN and SSD.
format	Online Article Text
id	pubmed-6603645
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-66036452019-07-17 Grid Based Spherical CNN for Object Detection from Panoramic Images Yu, Dawen Ji, Shunping Sensors (Basel) Article Recently proposed spherical convolutional neural networks (SCNNs) have shown advantages over conventional planar CNNs on classifying spherical images. However, two factors hamper their application in an objection detection task. First, a convolution in S2 (a two-dimensional sphere in three-dimensional space) or SO(3) (three-dimensional special orthogonal group) space results in the loss of an object’s location. Second, overlarge bandwidth is required to preserve a small object’s information on a sphere because the S2/SO(3) convolution must be performed on the whole sphere, instead of a local image patch. In this study, we propose a novel grid-based spherical CNN (G-SCNN) for detecting objects from spherical images. According to input bandwidth, a sphere image is transformed to a conformal grid map to be the input of the S2/SO3 convolution, and an object’s bounding box is scaled to cover an adequate area of the grid map. This solves the second problem. For the first problem, we utilize a planar region proposal network (RPN) with a data augmentation strategy that increases rotation invariance. We have also created a dataset including 600 street view panoramic images captured from a vehicle-borne panoramic camera. The dataset contains 5636 objects of interest annotated with class and bounding box and is named as WHU (Wuhan University) panoramic dataset. Results on the dataset proved our grid-based method is extremely better than the original SCNN in detecting objects from spherical images, and it outperformed several mainstream object detection networks, such as Faster R-CNN and SSD. MDPI 2019-06-09 /pmc/articles/PMC6603645/ /pubmed/31181854 http://dx.doi.org/10.3390/s19112622 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Yu, Dawen Ji, Shunping Grid Based Spherical CNN for Object Detection from Panoramic Images
title	Grid Based Spherical CNN for Object Detection from Panoramic Images
title_full	Grid Based Spherical CNN for Object Detection from Panoramic Images
title_fullStr	Grid Based Spherical CNN for Object Detection from Panoramic Images
title_full_unstemmed	Grid Based Spherical CNN for Object Detection from Panoramic Images
title_short	Grid Based Spherical CNN for Object Detection from Panoramic Images
title_sort	grid based spherical cnn for object detection from panoramic images
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6603645/ https://www.ncbi.nlm.nih.gov/pubmed/31181854 http://dx.doi.org/10.3390/s19112622
work_keys_str_mv	AT yudawen gridbasedsphericalcnnforobjectdetectionfrompanoramicimages AT jishunping gridbasedsphericalcnnforobjectdetectionfrompanoramicimages

Grid Based Spherical CNN for Object Detection from Panoramic Images

Ejemplares similares