Cargando…
A top-down manner-based DCNN architecture for semantic image segmentation
Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5365135/ https://www.ncbi.nlm.nih.gov/pubmed/28339486 http://dx.doi.org/10.1371/journal.pone.0174508 |
_version_ | 1782517463087316992 |
---|---|
author | Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin |
author_facet | Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin |
author_sort | Qiao, Kai |
collection | PubMed |
description | Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets. |
format | Online Article Text |
id | pubmed-5365135 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53651352017-04-06 A top-down manner-based DCNN architecture for semantic image segmentation Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin PLoS One Research Article Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets. Public Library of Science 2017-03-24 /pmc/articles/PMC5365135/ /pubmed/28339486 http://dx.doi.org/10.1371/journal.pone.0174508 Text en © 2017 Qiao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin A top-down manner-based DCNN architecture for semantic image segmentation |
title | A top-down manner-based DCNN architecture for semantic image segmentation |
title_full | A top-down manner-based DCNN architecture for semantic image segmentation |
title_fullStr | A top-down manner-based DCNN architecture for semantic image segmentation |
title_full_unstemmed | A top-down manner-based DCNN architecture for semantic image segmentation |
title_short | A top-down manner-based DCNN architecture for semantic image segmentation |
title_sort | top-down manner-based dcnn architecture for semantic image segmentation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5365135/ https://www.ncbi.nlm.nih.gov/pubmed/28339486 http://dx.doi.org/10.1371/journal.pone.0174508 |
work_keys_str_mv | AT qiaokai atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT chenjian atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT wanglinyuan atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT zenglei atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT yanbin atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT qiaokai topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT chenjian topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT wanglinyuan topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT zenglei topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT yanbin topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation |