Cargando…

Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection

Downsampling input images is a simple trick to speed up visual object-detection algorithms, especially on robotic vision and applied mobile vision systems. However, this trick comes with a significant decline in accuracy. In this paper, dual-resolution dual-path Convolutional Neural Networks (CNNs),...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pan, Jing, Sun, Hanqing, Song, Zhanjie, Han, Jungong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6679249/ https://www.ncbi.nlm.nih.gov/pubmed/31337121 http://dx.doi.org/10.3390/s19143111

_version_	1783441294953021440
author	Pan, Jing Sun, Hanqing Song, Zhanjie Han, Jungong
author_facet	Pan, Jing Sun, Hanqing Song, Zhanjie Han, Jungong
author_sort	Pan, Jing
collection	PubMed
description	Downsampling input images is a simple trick to speed up visual object-detection algorithms, especially on robotic vision and applied mobile vision systems. However, this trick comes with a significant decline in accuracy. In this paper, dual-resolution dual-path Convolutional Neural Networks (CNNs), named DualNets, are proposed to bump up the accuracy of those detection applications. In contrast to previous methods that simply downsample the input images, DualNets explicitly take dual inputs in different resolutions and extract complementary visual features from these using dual CNN paths. The two paths in a DualNet are a backbone path and an auxiliary path that accepts larger inputs and then rapidly downsamples them to relatively small feature maps. With the help of the carefully designed auxiliary CNN paths in DualNets, auxiliary features are extracted from the larger input with controllable computation. Auxiliary features are then fused with the backbone features using a proposed progressive residual fusion strategy to enrich feature representation.This architecture, as the feature extractor, is further integrated with the Single Shot Detector (SSD) to accomplish latency-sensitive visual object-detection tasks. We evaluate the resulting detection pipeline on Pascal VOC and MS COCO benchmarks. Results show that the proposed DualNets can raise the accuracy of those CNN detection applications that are sensitive to computation payloads.
format	Online Article Text
id	pubmed-6679249
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-66792492019-08-19 Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection Pan, Jing Sun, Hanqing Song, Zhanjie Han, Jungong Sensors (Basel) Article Downsampling input images is a simple trick to speed up visual object-detection algorithms, especially on robotic vision and applied mobile vision systems. However, this trick comes with a significant decline in accuracy. In this paper, dual-resolution dual-path Convolutional Neural Networks (CNNs), named DualNets, are proposed to bump up the accuracy of those detection applications. In contrast to previous methods that simply downsample the input images, DualNets explicitly take dual inputs in different resolutions and extract complementary visual features from these using dual CNN paths. The two paths in a DualNet are a backbone path and an auxiliary path that accepts larger inputs and then rapidly downsamples them to relatively small feature maps. With the help of the carefully designed auxiliary CNN paths in DualNets, auxiliary features are extracted from the larger input with controllable computation. Auxiliary features are then fused with the backbone features using a proposed progressive residual fusion strategy to enrich feature representation.This architecture, as the feature extractor, is further integrated with the Single Shot Detector (SSD) to accomplish latency-sensitive visual object-detection tasks. We evaluate the resulting detection pipeline on Pascal VOC and MS COCO benchmarks. Results show that the proposed DualNets can raise the accuracy of those CNN detection applications that are sensitive to computation payloads. MDPI 2019-07-14 /pmc/articles/PMC6679249/ /pubmed/31337121 http://dx.doi.org/10.3390/s19143111 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Pan, Jing Sun, Hanqing Song, Zhanjie Han, Jungong Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection
title	Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection
title_full	Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection
title_fullStr	Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection
title_full_unstemmed	Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection
title_short	Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection
title_sort	dual-resolution dual-path convolutional neural networks for fast object detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6679249/ https://www.ncbi.nlm.nih.gov/pubmed/31337121 http://dx.doi.org/10.3390/s19143111
work_keys_str_mv	AT panjing dualresolutiondualpathconvolutionalneuralnetworksforfastobjectdetection AT sunhanqing dualresolutiondualpathconvolutionalneuralnetworksforfastobjectdetection AT songzhanjie dualresolutiondualpathconvolutionalneuralnetworksforfastobjectdetection AT hanjungong dualresolutiondualpathconvolutionalneuralnetworksforfastobjectdetection

Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection

Ejemplares similares