Cargando…

High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network

Semantic segmentation of high-resolution aerial images is of great importance in certain fields, but the increasing spatial resolution brings large intra-class variance and small inter-class differences that can lead to classification ambiguities. Based on high-level contextual features, the deep co...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Xuran, Gao, Lianru, Zhang, Bing, Yang, Fan, Liao, Wenzhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6263496/
https://www.ncbi.nlm.nih.gov/pubmed/30400591
http://dx.doi.org/10.3390/s18113774
_version_ 1783375306786078720
author Pan, Xuran
Gao, Lianru
Zhang, Bing
Yang, Fan
Liao, Wenzhi
author_facet Pan, Xuran
Gao, Lianru
Zhang, Bing
Yang, Fan
Liao, Wenzhi
author_sort Pan, Xuran
collection PubMed
description Semantic segmentation of high-resolution aerial images is of great importance in certain fields, but the increasing spatial resolution brings large intra-class variance and small inter-class differences that can lead to classification ambiguities. Based on high-level contextual features, the deep convolutional neural network (DCNN) is an effective method to deal with semantic segmentation of high-resolution aerial imagery. In this work, a novel dense pyramid network (DPN) is proposed for semantic segmentation. The network starts with group convolutions to deal with multi-sensor data in channel wise to extract feature maps of each channel separately; by doing so, more information from each channel can be preserved. This process is followed by the channel shuffle operation to enhance the representation ability of the network. Then, four densely connected convolutional blocks are utilized to both extract and take full advantage of features. The pyramid pooling module combined with two convolutional layers are set to fuse multi-resolution and multi-sensor features through an effective global scenery prior manner, producing the probability graph for each class. Moreover, the median frequency balanced focal loss is proposed to replace the standard cross entropy loss in the training phase to deal with the class imbalance problem. We evaluate the dense pyramid network on the International Society for Photogrammetry and Remote Sensing (ISPRS) Vaihingen and Potsdam 2D semantic labeling dataset, and the results demonstrate that the proposed framework exhibits better performances, compared to the state of the art baseline.
format Online
Article
Text
id pubmed-6263496
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62634962018-12-12 High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network Pan, Xuran Gao, Lianru Zhang, Bing Yang, Fan Liao, Wenzhi Sensors (Basel) Article Semantic segmentation of high-resolution aerial images is of great importance in certain fields, but the increasing spatial resolution brings large intra-class variance and small inter-class differences that can lead to classification ambiguities. Based on high-level contextual features, the deep convolutional neural network (DCNN) is an effective method to deal with semantic segmentation of high-resolution aerial imagery. In this work, a novel dense pyramid network (DPN) is proposed for semantic segmentation. The network starts with group convolutions to deal with multi-sensor data in channel wise to extract feature maps of each channel separately; by doing so, more information from each channel can be preserved. This process is followed by the channel shuffle operation to enhance the representation ability of the network. Then, four densely connected convolutional blocks are utilized to both extract and take full advantage of features. The pyramid pooling module combined with two convolutional layers are set to fuse multi-resolution and multi-sensor features through an effective global scenery prior manner, producing the probability graph for each class. Moreover, the median frequency balanced focal loss is proposed to replace the standard cross entropy loss in the training phase to deal with the class imbalance problem. We evaluate the dense pyramid network on the International Society for Photogrammetry and Remote Sensing (ISPRS) Vaihingen and Potsdam 2D semantic labeling dataset, and the results demonstrate that the proposed framework exhibits better performances, compared to the state of the art baseline. MDPI 2018-11-05 /pmc/articles/PMC6263496/ /pubmed/30400591 http://dx.doi.org/10.3390/s18113774 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pan, Xuran
Gao, Lianru
Zhang, Bing
Yang, Fan
Liao, Wenzhi
High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network
title High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network
title_full High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network
title_fullStr High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network
title_full_unstemmed High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network
title_short High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network
title_sort high-resolution aerial imagery semantic labeling with dense pyramid network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6263496/
https://www.ncbi.nlm.nih.gov/pubmed/30400591
http://dx.doi.org/10.3390/s18113774
work_keys_str_mv AT panxuran highresolutionaerialimagerysemanticlabelingwithdensepyramidnetwork
AT gaolianru highresolutionaerialimagerysemanticlabelingwithdensepyramidnetwork
AT zhangbing highresolutionaerialimagerysemanticlabelingwithdensepyramidnetwork
AT yangfan highresolutionaerialimagerysemanticlabelingwithdensepyramidnetwork
AT liaowenzhi highresolutionaerialimagerysemanticlabelingwithdensepyramidnetwork