Cargando…

Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images

Accurate and robust detection of multi-class objects in very high resolution (VHR) aerial images has been playing a significant role in many real-world applications. The traditional detection methods have made remarkable progresses with horizontal bounding boxes (HBBs) due to CNNs. However, HBB dete...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Feng, Li, Wentong, Hu, Haiwei, Li, Wanyi, Wang, Peng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146516/
https://www.ncbi.nlm.nih.gov/pubmed/32197365
http://dx.doi.org/10.3390/s20061686
_version_ 1783520221057777664
author Yang, Feng
Li, Wentong
Hu, Haiwei
Li, Wanyi
Wang, Peng
author_facet Yang, Feng
Li, Wentong
Hu, Haiwei
Li, Wanyi
Wang, Peng
author_sort Yang, Feng
collection PubMed
description Accurate and robust detection of multi-class objects in very high resolution (VHR) aerial images has been playing a significant role in many real-world applications. The traditional detection methods have made remarkable progresses with horizontal bounding boxes (HBBs) due to CNNs. However, HBB detection methods still exhibit limitations including the missed detection and the redundant detection regions, especially for densely-distributed and strip-like objects. Besides, large scale variations and diverse background also bring in many challenges. Aiming to address these problems, an effective region-based object detection framework named Multi-scale Feature Integration Attention Rotation Network (MFIAR-Net) is proposed for aerial images with oriented bounding boxes (OBBs), which promotes the integration of the inherent multi-scale pyramid features to generate a discriminative feature map. Meanwhile, the double-path feature attention network supervised by the mask information of ground truth is introduced to guide the network to focus on object regions and suppress the irrelevant noise. To boost the rotation regression and classification performance, we present a robust Rotation Detection Network, which can generate efficient OBB representation. Extensive experiments and comprehensive evaluations on two publicly available datasets demonstrate the effectiveness of the proposed framework.
format Online
Article
Text
id pubmed-7146516
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71465162020-04-20 Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images Yang, Feng Li, Wentong Hu, Haiwei Li, Wanyi Wang, Peng Sensors (Basel) Article Accurate and robust detection of multi-class objects in very high resolution (VHR) aerial images has been playing a significant role in many real-world applications. The traditional detection methods have made remarkable progresses with horizontal bounding boxes (HBBs) due to CNNs. However, HBB detection methods still exhibit limitations including the missed detection and the redundant detection regions, especially for densely-distributed and strip-like objects. Besides, large scale variations and diverse background also bring in many challenges. Aiming to address these problems, an effective region-based object detection framework named Multi-scale Feature Integration Attention Rotation Network (MFIAR-Net) is proposed for aerial images with oriented bounding boxes (OBBs), which promotes the integration of the inherent multi-scale pyramid features to generate a discriminative feature map. Meanwhile, the double-path feature attention network supervised by the mask information of ground truth is introduced to guide the network to focus on object regions and suppress the irrelevant noise. To boost the rotation regression and classification performance, we present a robust Rotation Detection Network, which can generate efficient OBB representation. Extensive experiments and comprehensive evaluations on two publicly available datasets demonstrate the effectiveness of the proposed framework. MDPI 2020-03-18 /pmc/articles/PMC7146516/ /pubmed/32197365 http://dx.doi.org/10.3390/s20061686 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yang, Feng
Li, Wentong
Hu, Haiwei
Li, Wanyi
Wang, Peng
Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
title Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
title_full Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
title_fullStr Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
title_full_unstemmed Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
title_short Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
title_sort multi-scale feature integrated attention-based rotation network for object detection in vhr aerial images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146516/
https://www.ncbi.nlm.nih.gov/pubmed/32197365
http://dx.doi.org/10.3390/s20061686
work_keys_str_mv AT yangfeng multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages
AT liwentong multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages
AT huhaiwei multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages
AT liwanyi multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages
AT wangpeng multiscalefeatureintegratedattentionbasedrotationnetworkforobjectdetectioninvhraerialimages