Cargando…

A feature fusion deep-projection convolution neural network for vehicle detection in aerial images

With the rapid development of Unmanned Aerial Vehicles, vehicle detection in aerial images plays an important role in different applications. Comparing with general object detection problems, vehicle detection in aerial images is still a challenging research topic since it is plagued by various uniq...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Bin, Xu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8104367/
https://www.ncbi.nlm.nih.gov/pubmed/33961655
http://dx.doi.org/10.1371/journal.pone.0250782
_version_ 1783689479940210688
author Wang, Bin
Xu, Bin
author_facet Wang, Bin
Xu, Bin
author_sort Wang, Bin
collection PubMed
description With the rapid development of Unmanned Aerial Vehicles, vehicle detection in aerial images plays an important role in different applications. Comparing with general object detection problems, vehicle detection in aerial images is still a challenging research topic since it is plagued by various unique factors, e.g. different camera angle, small vehicle size and complex background. In this paper, a Feature Fusion Deep-Projection Convolution Neural Network is proposed to enhance the ability to detect small vehicles in aerial images. The backbone of the proposed framework utilizes a novel residual block named stepwise res-block to explore high-level semantic features as well as conserve low-level detail features at the same time. A specially designed feature fusion module is adopted in the proposed framework to further balance the features obtained from different levels of the backbone. A deep-projection deconvolution module is used to minimize the impact of the information contamination introduced by down-sampling/up-sampling processes. The proposed framework has been evaluated by UCAS-AOD, VEDAI, and DOTA datasets. According to the evaluation results, the proposed framework outperforms other state-of-the-art vehicle detection algorithms for aerial images.
format Online
Article
Text
id pubmed-8104367
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-81043672021-05-18 A feature fusion deep-projection convolution neural network for vehicle detection in aerial images Wang, Bin Xu, Bin PLoS One Research Article With the rapid development of Unmanned Aerial Vehicles, vehicle detection in aerial images plays an important role in different applications. Comparing with general object detection problems, vehicle detection in aerial images is still a challenging research topic since it is plagued by various unique factors, e.g. different camera angle, small vehicle size and complex background. In this paper, a Feature Fusion Deep-Projection Convolution Neural Network is proposed to enhance the ability to detect small vehicles in aerial images. The backbone of the proposed framework utilizes a novel residual block named stepwise res-block to explore high-level semantic features as well as conserve low-level detail features at the same time. A specially designed feature fusion module is adopted in the proposed framework to further balance the features obtained from different levels of the backbone. A deep-projection deconvolution module is used to minimize the impact of the information contamination introduced by down-sampling/up-sampling processes. The proposed framework has been evaluated by UCAS-AOD, VEDAI, and DOTA datasets. According to the evaluation results, the proposed framework outperforms other state-of-the-art vehicle detection algorithms for aerial images. Public Library of Science 2021-05-07 /pmc/articles/PMC8104367/ /pubmed/33961655 http://dx.doi.org/10.1371/journal.pone.0250782 Text en © 2021 Wang, Xu https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Bin
Xu, Bin
A feature fusion deep-projection convolution neural network for vehicle detection in aerial images
title A feature fusion deep-projection convolution neural network for vehicle detection in aerial images
title_full A feature fusion deep-projection convolution neural network for vehicle detection in aerial images
title_fullStr A feature fusion deep-projection convolution neural network for vehicle detection in aerial images
title_full_unstemmed A feature fusion deep-projection convolution neural network for vehicle detection in aerial images
title_short A feature fusion deep-projection convolution neural network for vehicle detection in aerial images
title_sort feature fusion deep-projection convolution neural network for vehicle detection in aerial images
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8104367/
https://www.ncbi.nlm.nih.gov/pubmed/33961655
http://dx.doi.org/10.1371/journal.pone.0250782
work_keys_str_mv AT wangbin afeaturefusiondeepprojectionconvolutionneuralnetworkforvehicledetectioninaerialimages
AT xubin afeaturefusiondeepprojectionconvolutionneuralnetworkforvehicledetectioninaerialimages
AT wangbin featurefusiondeepprojectionconvolutionneuralnetworkforvehicledetectioninaerialimages
AT xubin featurefusiondeepprojectionconvolutionneuralnetworkforvehicledetectioninaerialimages