Cargando…

EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation

This paper proposes an encoding–decoding network with a pyramidal representation module, which will be referred to as EDPNet, and is designed for efficient semantic image segmentation. On the one hand, during the encoding process of the proposed EDPNet, the enhancement of the Xception network, i.e.,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Dong, Li, Xianghong, Hu, Fan, Mathiopoulos, P. Takis, Di, Shaoning, Sui, Mingming, Peethambaran, Jiju
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10058792/ https://www.ncbi.nlm.nih.gov/pubmed/36991916 http://dx.doi.org/10.3390/s23063205

_version_	1785016719681519616
author	Chen, Dong Li, Xianghong Hu, Fan Mathiopoulos, P. Takis Di, Shaoning Sui, Mingming Peethambaran, Jiju
author_facet	Chen, Dong Li, Xianghong Hu, Fan Mathiopoulos, P. Takis Di, Shaoning Sui, Mingming Peethambaran, Jiju
author_sort	Chen, Dong
collection	PubMed
description	This paper proposes an encoding–decoding network with a pyramidal representation module, which will be referred to as EDPNet, and is designed for efficient semantic image segmentation. On the one hand, during the encoding process of the proposed EDPNet, the enhancement of the Xception network, i.e., Xception+ is employed as a backbone to learn the discriminative feature maps. The obtained discriminative features are then fed into the pyramidal representation module, from which the context-augmented features are learned and optimized by leveraging a multi-level feature representation and aggregation process. On the other hand, during the image restoration decoding process, the encoded semantic-rich features are progressively recovered with the assistance of a simplified skip connection mechanism, which performs channel concatenation between high-level encoded features with rich semantic information and low-level features with spatial detail information. The proposed hybrid representation employing the proposed encoding–decoding and pyramidal structures has a global-aware perception and captures fine-grained contours of various geographical objects very well with high computational efficiency. The performance of the proposed EDPNet has been compared against PSPNet, DeepLabv3, and U-Net, employing four benchmark datasets, namely eTRIMS, Cityscapes, PASCAL VOC2012, and CamVid. EDPNet acquired the highest accuracy of 83.6% and 73.8% mIoUs on eTRIMS and PASCAL VOC2012 datasets, while its accuracy on the other two datasets was comparable to that of PSPNet, DeepLabv3, and U-Net models. EDPNet achieved the highest efficiency among the compared models on all datasets.
format	Online Article Text
id	pubmed-10058792
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100587922023-03-30 EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation Chen, Dong Li, Xianghong Hu, Fan Mathiopoulos, P. Takis Di, Shaoning Sui, Mingming Peethambaran, Jiju Sensors (Basel) Article This paper proposes an encoding–decoding network with a pyramidal representation module, which will be referred to as EDPNet, and is designed for efficient semantic image segmentation. On the one hand, during the encoding process of the proposed EDPNet, the enhancement of the Xception network, i.e., Xception+ is employed as a backbone to learn the discriminative feature maps. The obtained discriminative features are then fed into the pyramidal representation module, from which the context-augmented features are learned and optimized by leveraging a multi-level feature representation and aggregation process. On the other hand, during the image restoration decoding process, the encoded semantic-rich features are progressively recovered with the assistance of a simplified skip connection mechanism, which performs channel concatenation between high-level encoded features with rich semantic information and low-level features with spatial detail information. The proposed hybrid representation employing the proposed encoding–decoding and pyramidal structures has a global-aware perception and captures fine-grained contours of various geographical objects very well with high computational efficiency. The performance of the proposed EDPNet has been compared against PSPNet, DeepLabv3, and U-Net, employing four benchmark datasets, namely eTRIMS, Cityscapes, PASCAL VOC2012, and CamVid. EDPNet acquired the highest accuracy of 83.6% and 73.8% mIoUs on eTRIMS and PASCAL VOC2012 datasets, while its accuracy on the other two datasets was comparable to that of PSPNet, DeepLabv3, and U-Net models. EDPNet achieved the highest efficiency among the compared models on all datasets. MDPI 2023-03-17 /pmc/articles/PMC10058792/ /pubmed/36991916 http://dx.doi.org/10.3390/s23063205 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Chen, Dong Li, Xianghong Hu, Fan Mathiopoulos, P. Takis Di, Shaoning Sui, Mingming Peethambaran, Jiju EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation
title	EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation
title_full	EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation
title_fullStr	EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation
title_full_unstemmed	EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation
title_short	EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation
title_sort	edpnet: an encoding–decoding network with pyramidal representation for semantic image segmentation
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10058792/ https://www.ncbi.nlm.nih.gov/pubmed/36991916 http://dx.doi.org/10.3390/s23063205
work_keys_str_mv	AT chendong edpnetanencodingdecodingnetworkwithpyramidalrepresentationforsemanticimagesegmentation AT lixianghong edpnetanencodingdecodingnetworkwithpyramidalrepresentationforsemanticimagesegmentation AT hufan edpnetanencodingdecodingnetworkwithpyramidalrepresentationforsemanticimagesegmentation AT mathiopoulosptakis edpnetanencodingdecodingnetworkwithpyramidalrepresentationforsemanticimagesegmentation AT dishaoning edpnetanencodingdecodingnetworkwithpyramidalrepresentationforsemanticimagesegmentation AT suimingming edpnetanencodingdecodingnetworkwithpyramidalrepresentationforsemanticimagesegmentation AT peethambaranjiju edpnetanencodingdecodingnetworkwithpyramidalrepresentationforsemanticimagesegmentation

EDPNet: An Encoding–Decoding Network with Pyramidal Representation for Semantic Image Segmentation

Ejemplares similares