Cargando…

Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation

Semantic segmentation for accurate visual perception is a critical task in computer vision. In principle, the automatic classification of dynamic visual scenes using predefined object classes remains unresolved. The challenging problems of learning deep convolution neural networks, specifically ResN...

Descripción completa

Detalles Bibliográficos
Autores principales: Memon, Mehak Maqbool, Hashmani, Manzoor Ahmed, Junejo, Aisha Zahid, Rizvi, Syed Sajjad, Raza, Kamran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9324997/
https://www.ncbi.nlm.nih.gov/pubmed/35890992
http://dx.doi.org/10.3390/s22145312
_version_ 1784756940577964032
author Memon, Mehak Maqbool
Hashmani, Manzoor Ahmed
Junejo, Aisha Zahid
Rizvi, Syed Sajjad
Raza, Kamran
author_facet Memon, Mehak Maqbool
Hashmani, Manzoor Ahmed
Junejo, Aisha Zahid
Rizvi, Syed Sajjad
Raza, Kamran
author_sort Memon, Mehak Maqbool
collection PubMed
description Semantic segmentation for accurate visual perception is a critical task in computer vision. In principle, the automatic classification of dynamic visual scenes using predefined object classes remains unresolved. The challenging problems of learning deep convolution neural networks, specifically ResNet-based DeepLabV3+ (the most recent version), are threefold. The problems arise due to (1) biased centric exploitations of filter masks, (2) lower representational power of residual networks due to identity shortcuts, and (3) a loss of spatial relationship by using per-pixel primitives. To solve these problems, we present a proficient approach based on DeepLabV3+, along with an added evaluation metric, namely, Unified DeepLabV3+ and [Formula: see text] , respectively. The presented unified version reduced the effect of biased exploitations via additional dilated convolution layers with customized dilation rates. We further tackled the problem of representational power by introducing non-linear group normalization shortcuts to solve the focused problem of semi-dark images. Meanwhile, to keep track of the spatial relationships in terms of the global and local contexts, geometrically bunched pixel cues were used. We accumulated all the proposed variants of DeepLabV3+ to propose Unified DeepLabV3+ for accurate visual decisions. Finally, the proposed [Formula: see text] evaluation metric was based on the weighted combination of three different accuracy measures, i.e., the pixel accuracy, IoU (intersection over union), and Mean BFScore, as robust identification criteria. Extensive experimental analysis performed over a CamVid dataset confirmed the applicability of the proposed solution for autonomous vehicles and robotics for outdoor settings. The experimental analysis showed that the proposed Unified DeepLabV3+ outperformed DeepLabV3+ by a margin of 3% in terms of the class-wise pixel accuracy, along with a higher [Formula: see text] , depicting the effectiveness of the proposed approach.
format Online
Article
Text
id pubmed-9324997
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93249972022-07-27 Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation Memon, Mehak Maqbool Hashmani, Manzoor Ahmed Junejo, Aisha Zahid Rizvi, Syed Sajjad Raza, Kamran Sensors (Basel) Article Semantic segmentation for accurate visual perception is a critical task in computer vision. In principle, the automatic classification of dynamic visual scenes using predefined object classes remains unresolved. The challenging problems of learning deep convolution neural networks, specifically ResNet-based DeepLabV3+ (the most recent version), are threefold. The problems arise due to (1) biased centric exploitations of filter masks, (2) lower representational power of residual networks due to identity shortcuts, and (3) a loss of spatial relationship by using per-pixel primitives. To solve these problems, we present a proficient approach based on DeepLabV3+, along with an added evaluation metric, namely, Unified DeepLabV3+ and [Formula: see text] , respectively. The presented unified version reduced the effect of biased exploitations via additional dilated convolution layers with customized dilation rates. We further tackled the problem of representational power by introducing non-linear group normalization shortcuts to solve the focused problem of semi-dark images. Meanwhile, to keep track of the spatial relationships in terms of the global and local contexts, geometrically bunched pixel cues were used. We accumulated all the proposed variants of DeepLabV3+ to propose Unified DeepLabV3+ for accurate visual decisions. Finally, the proposed [Formula: see text] evaluation metric was based on the weighted combination of three different accuracy measures, i.e., the pixel accuracy, IoU (intersection over union), and Mean BFScore, as robust identification criteria. Extensive experimental analysis performed over a CamVid dataset confirmed the applicability of the proposed solution for autonomous vehicles and robotics for outdoor settings. The experimental analysis showed that the proposed Unified DeepLabV3+ outperformed DeepLabV3+ by a margin of 3% in terms of the class-wise pixel accuracy, along with a higher [Formula: see text] , depicting the effectiveness of the proposed approach. MDPI 2022-07-15 /pmc/articles/PMC9324997/ /pubmed/35890992 http://dx.doi.org/10.3390/s22145312 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Memon, Mehak Maqbool
Hashmani, Manzoor Ahmed
Junejo, Aisha Zahid
Rizvi, Syed Sajjad
Raza, Kamran
Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation
title Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation
title_full Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation
title_fullStr Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation
title_full_unstemmed Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation
title_short Unified DeepLabV3+ for Semi-Dark Image Semantic Segmentation
title_sort unified deeplabv3+ for semi-dark image semantic segmentation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9324997/
https://www.ncbi.nlm.nih.gov/pubmed/35890992
http://dx.doi.org/10.3390/s22145312
work_keys_str_mv AT memonmehakmaqbool unifieddeeplabv3forsemidarkimagesemanticsegmentation
AT hashmanimanzoorahmed unifieddeeplabv3forsemidarkimagesemanticsegmentation
AT junejoaishazahid unifieddeeplabv3forsemidarkimagesemanticsegmentation
AT rizvisyedsajjad unifieddeeplabv3forsemidarkimagesemanticsegmentation
AT razakamran unifieddeeplabv3forsemidarkimagesemanticsegmentation