Cargando…
DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images
With the rapid development of deep learning and the wide usage of Unmanned Aerial Vehicles (UAVs), CNN-based algorithms of vehicle detection in aerial images have been widely studied in the past several years. As a downstream task of the general object detection, there are some differences between t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642025/ https://www.ncbi.nlm.nih.gov/pubmed/34868295 http://dx.doi.org/10.1155/2021/6340823 |
_version_ | 1784609605376016384 |
---|---|
author | Li, Kaifeng Wang, Bin |
author_facet | Li, Kaifeng Wang, Bin |
author_sort | Li, Kaifeng |
collection | PubMed |
description | With the rapid development of deep learning and the wide usage of Unmanned Aerial Vehicles (UAVs), CNN-based algorithms of vehicle detection in aerial images have been widely studied in the past several years. As a downstream task of the general object detection, there are some differences between the vehicle detection in aerial images and the general object detection in ground view images, e.g., larger image areas, smaller target sizes, and more complex background. In this paper, to improve the performance of this task, a Dense Attentional Residual Network (DAR-Net) is proposed. The proposed network employs a novel dense waterfall residual block (DW res-block) to effectively preserve the spatial information and extract high-level semantic information at the same time. A multiscale receptive field attention (MRFA) module is also designed to select the informative feature from the feature maps and enhance the ability of multiscale perception. Based on the DW res-block and MRFA module, to protect the spatial information, the proposed framework adopts a new backbone that only downsamples the feature map 3 times; i.e., the total downsampling ratio of the proposed backbone is 8. These designs could alleviate the degradation problem, improve the information flow, and strengthen the feature reuse. In addition, deep-projection units are used to reduce the impact of information loss caused by downsampling operations, and the identity mapping is applied to each stage of the proposed backbone to further improve the information flow. The proposed DAR-Net is evaluated on VEDAI, UCAS-AOD, and DOTA datasets. The experimental results demonstrate that the proposed framework outperforms other state-of-the-art algorithms. |
format | Online Article Text |
id | pubmed-8642025 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-86420252021-12-04 DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images Li, Kaifeng Wang, Bin Comput Intell Neurosci Research Article With the rapid development of deep learning and the wide usage of Unmanned Aerial Vehicles (UAVs), CNN-based algorithms of vehicle detection in aerial images have been widely studied in the past several years. As a downstream task of the general object detection, there are some differences between the vehicle detection in aerial images and the general object detection in ground view images, e.g., larger image areas, smaller target sizes, and more complex background. In this paper, to improve the performance of this task, a Dense Attentional Residual Network (DAR-Net) is proposed. The proposed network employs a novel dense waterfall residual block (DW res-block) to effectively preserve the spatial information and extract high-level semantic information at the same time. A multiscale receptive field attention (MRFA) module is also designed to select the informative feature from the feature maps and enhance the ability of multiscale perception. Based on the DW res-block and MRFA module, to protect the spatial information, the proposed framework adopts a new backbone that only downsamples the feature map 3 times; i.e., the total downsampling ratio of the proposed backbone is 8. These designs could alleviate the degradation problem, improve the information flow, and strengthen the feature reuse. In addition, deep-projection units are used to reduce the impact of information loss caused by downsampling operations, and the identity mapping is applied to each stage of the proposed backbone to further improve the information flow. The proposed DAR-Net is evaluated on VEDAI, UCAS-AOD, and DOTA datasets. The experimental results demonstrate that the proposed framework outperforms other state-of-the-art algorithms. Hindawi 2021-11-26 /pmc/articles/PMC8642025/ /pubmed/34868295 http://dx.doi.org/10.1155/2021/6340823 Text en Copyright © 2021 Kaifeng Li and Bin Wang. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Li, Kaifeng Wang, Bin DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images |
title | DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images |
title_full | DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images |
title_fullStr | DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images |
title_full_unstemmed | DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images |
title_short | DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images |
title_sort | dar-net: dense attentional residual network for vehicle detection in aerial images |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8642025/ https://www.ncbi.nlm.nih.gov/pubmed/34868295 http://dx.doi.org/10.1155/2021/6340823 |
work_keys_str_mv | AT likaifeng darnetdenseattentionalresidualnetworkforvehicledetectioninaerialimages AT wangbin darnetdenseattentionalresidualnetworkforvehicledetectioninaerialimages |