Cargando…

A top-down manner-based DCNN architecture for semantic image segmentation

Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiao, Kai, Chen, Jian, Wang, Linyuan, Zeng, Lei, Yan, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5365135/
https://www.ncbi.nlm.nih.gov/pubmed/28339486
http://dx.doi.org/10.1371/journal.pone.0174508
_version_ 1782517463087316992
author Qiao, Kai
Chen, Jian
Wang, Linyuan
Zeng, Lei
Yan, Bin
author_facet Qiao, Kai
Chen, Jian
Wang, Linyuan
Zeng, Lei
Yan, Bin
author_sort Qiao, Kai
collection PubMed
description Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets.
format Online
Article
Text
id pubmed-5365135
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53651352017-04-06 A top-down manner-based DCNN architecture for semantic image segmentation Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin PLoS One Research Article Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets. Public Library of Science 2017-03-24 /pmc/articles/PMC5365135/ /pubmed/28339486 http://dx.doi.org/10.1371/journal.pone.0174508 Text en © 2017 Qiao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Qiao, Kai
Chen, Jian
Wang, Linyuan
Zeng, Lei
Yan, Bin
A top-down manner-based DCNN architecture for semantic image segmentation
title A top-down manner-based DCNN architecture for semantic image segmentation
title_full A top-down manner-based DCNN architecture for semantic image segmentation
title_fullStr A top-down manner-based DCNN architecture for semantic image segmentation
title_full_unstemmed A top-down manner-based DCNN architecture for semantic image segmentation
title_short A top-down manner-based DCNN architecture for semantic image segmentation
title_sort top-down manner-based dcnn architecture for semantic image segmentation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5365135/
https://www.ncbi.nlm.nih.gov/pubmed/28339486
http://dx.doi.org/10.1371/journal.pone.0174508
work_keys_str_mv AT qiaokai atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT chenjian atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT wanglinyuan atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT zenglei atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT yanbin atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT qiaokai topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT chenjian topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT wanglinyuan topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT zenglei topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation
AT yanbin topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation