Cargando…
Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
Accurate and robust detection of multi-class objects in very high resolution (VHR) aerial images has been playing a significant role in many real-world applications. The traditional detection methods have made remarkable progresses with horizontal bounding boxes (HBBs) due to CNNs. However, HBB dete...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146516/ https://www.ncbi.nlm.nih.gov/pubmed/32197365 http://dx.doi.org/10.3390/s20061686 |
_version_ | 1783520221057777664 |
---|---|
author | Yang, Feng Li, Wentong Hu, Haiwei Li, Wanyi Wang, Peng |
author_facet | Yang, Feng Li, Wentong Hu, Haiwei Li, Wanyi Wang, Peng |
author_sort | Yang, Feng |
collection | PubMed |
description | Accurate and robust detection of multi-class objects in very high resolution (VHR) aerial images has been playing a significant role in many real-world applications. The traditional detection methods have made remarkable progresses with horizontal bounding boxes (HBBs) due to CNNs. However, HBB detection methods still exhibit limitations including the missed detection and the redundant detection regions, especially for densely-distributed and strip-like objects. Besides, large scale variations and diverse background also bring in many challenges. Aiming to address these problems, an effective region-based object detection framework named Multi-scale Feature Integration Attention Rotation Network (MFIAR-Net) is proposed for aerial images with oriented bounding boxes (OBBs), which promotes the integration of the inherent multi-scale pyramid features to generate a discriminative feature map. Meanwhile, the double-path feature attention network supervised by the mask information of ground truth is introduced to guide the network to focus on object regions and suppress the irrelevant noise. To boost the rotation regression and classification performance, we present a robust Rotation Detection Network, which can generate efficient OBB representation. Extensive experiments and comprehensive evaluations on two publicly available datasets demonstrate the effectiveness of the proposed framework. |
format | Online Article Text |
id | pubmed-7146516 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-71465162020-04-20 Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images Yang, Feng Li, Wentong Hu, Haiwei Li, Wanyi Wang, Peng Sensors (Basel) Article Accurate and robust detection of multi-class objects in very high resolution (VHR) aerial images has been playing a significant role in many real-world applications. The traditional detection methods have made remarkable progresses with horizontal bounding boxes (HBBs) due to CNNs. However, HBB detection methods still exhibit limitations including the missed detection and the redundant detection regions, especially for densely-distributed and strip-like objects. Besides, large scale variations and diverse background also bring in many challenges. Aiming to address these problems, an effective region-based object detection framework named Multi-scale Feature Integration Attention Rotation Network (MFIAR-Net) is proposed for aerial images with oriented bounding boxes (OBBs), which promotes the integration of the inherent multi-scale pyramid features to generate a discriminative feature map. Meanwhile, the double-path feature attention network supervised by the mask information of ground truth is introduced to guide the network to focus on object regions and suppress the irrelevant noise. To boost the rotation regression and classification performance, we present a robust Rotation Detection Network, which can generate efficient OBB representation. Extensive experiments and comprehensive evaluations on two publicly available datasets demonstrate the effectiveness of the proposed framework. MDPI 2020-03-18 /pmc/articles/PMC7146516/ /pubmed/32197365 http://dx.doi.org/10.3390/s20061686 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yang, Feng Li, Wentong Hu, Haiwei Li, Wanyi Wang, Peng Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images |
title | Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images |
title_full | Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images |
title_fullStr | Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images |
title_full_unstemmed | Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images |
title_short | Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images |
title_sort | multi-scale feature integrated attention-based rotation network for object detection in vhr aerial images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146516/ https://www.ncbi.nlm.nih.gov/pubmed/32197365 http://dx.doi.org/10.3390/s20061686 |
work_keys_str_mv | AT yangfeng multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages AT liwentong multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages AT huhaiwei multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages AT liwanyi multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages AT wangpeng multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages |