Cargando…

A top-down manner-based DCNN architecture for semantic image segmentation

Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism...

Descripción completa

Detalles Bibliográficos
Autores principales:	Qiao, Kai, Chen, Jian, Wang, Linyuan, Zeng, Lei, Yan, Bin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5365135/ https://www.ncbi.nlm.nih.gov/pubmed/28339486 http://dx.doi.org/10.1371/journal.pone.0174508

_version_	1782517463087316992
author	Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin
author_facet	Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin
author_sort	Qiao, Kai
collection	PubMed
description	Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets.
format	Online Article Text
id	pubmed-5365135
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-53651352017-04-06 A top-down manner-based DCNN architecture for semantic image segmentation Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin PLoS One Research Article Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets. Public Library of Science 2017-03-24 /pmc/articles/PMC5365135/ /pubmed/28339486 http://dx.doi.org/10.1371/journal.pone.0174508 Text en © 2017 Qiao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Qiao, Kai Chen, Jian Wang, Linyuan Zeng, Lei Yan, Bin A top-down manner-based DCNN architecture for semantic image segmentation
title	A top-down manner-based DCNN architecture for semantic image segmentation
title_full	A top-down manner-based DCNN architecture for semantic image segmentation
title_fullStr	A top-down manner-based DCNN architecture for semantic image segmentation
title_full_unstemmed	A top-down manner-based DCNN architecture for semantic image segmentation
title_short	A top-down manner-based DCNN architecture for semantic image segmentation
title_sort	top-down manner-based dcnn architecture for semantic image segmentation
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5365135/ https://www.ncbi.nlm.nih.gov/pubmed/28339486 http://dx.doi.org/10.1371/journal.pone.0174508
work_keys_str_mv	AT qiaokai atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT chenjian atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT wanglinyuan atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT zenglei atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT yanbin atopdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT qiaokai topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT chenjian topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT wanglinyuan topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT zenglei topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation AT yanbin topdownmannerbaseddcnnarchitectureforsemanticimagesegmentation

A top-down manner-based DCNN architecture for semantic image segmentation

Ejemplares similares