Cargando…

Visual Saliency Models for Text Detection in Real World

This paper evaluates the degree of saliency of texts in natural scenes using visual saliency models. A large scale scene image database with pixel level ground truth is created for this purpose. Using this scene image database and five state-of-the-art models, visual saliency maps that represent the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gao, Renwu, Uchida, Seiichi, Shahab, Asif, Shafait, Faisal, Frinken, Volkmar
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4262416/ https://www.ncbi.nlm.nih.gov/pubmed/25494196 http://dx.doi.org/10.1371/journal.pone.0114539

_version_	1782348428706054144
author	Gao, Renwu Uchida, Seiichi Shahab, Asif Shafait, Faisal Frinken, Volkmar
author_facet	Gao, Renwu Uchida, Seiichi Shahab, Asif Shafait, Faisal Frinken, Volkmar
author_sort	Gao, Renwu
collection	PubMed
description	This paper evaluates the degree of saliency of texts in natural scenes using visual saliency models. A large scale scene image database with pixel level ground truth is created for this purpose. Using this scene image database and five state-of-the-art models, visual saliency maps that represent the degree of saliency of the objects are calculated. The receiver operating characteristic curve is employed in order to evaluate the saliency of scene texts, which is calculated by visual saliency models. A visualization of the distribution of scene texts and non-texts in the space constructed by three kinds of saliency maps, which are calculated using Itti's visual saliency model with intensity, color and orientation features, is given. This visualization of distribution indicates that text characters are more salient than their non-text neighbors, and can be captured from the background. Therefore, scene texts can be extracted from the scene images. With this in mind, a new visual saliency architecture, named hierarchical visual saliency model, is proposed. Hierarchical visual saliency model is based on Itti's model and consists of two stages. In the first stage, Itti's model is used to calculate the saliency map, and Otsu's global thresholding algorithm is applied to extract the salient region that we are interested in. In the second stage, Itti's model is applied to the salient region to calculate the final saliency map. An experimental evaluation demonstrates that the proposed model outperforms Itti's model in terms of captured scene texts.
format	Online Article Text
id	pubmed-4262416
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-42624162014-12-15 Visual Saliency Models for Text Detection in Real World Gao, Renwu Uchida, Seiichi Shahab, Asif Shafait, Faisal Frinken, Volkmar PLoS One Research Article This paper evaluates the degree of saliency of texts in natural scenes using visual saliency models. A large scale scene image database with pixel level ground truth is created for this purpose. Using this scene image database and five state-of-the-art models, visual saliency maps that represent the degree of saliency of the objects are calculated. The receiver operating characteristic curve is employed in order to evaluate the saliency of scene texts, which is calculated by visual saliency models. A visualization of the distribution of scene texts and non-texts in the space constructed by three kinds of saliency maps, which are calculated using Itti's visual saliency model with intensity, color and orientation features, is given. This visualization of distribution indicates that text characters are more salient than their non-text neighbors, and can be captured from the background. Therefore, scene texts can be extracted from the scene images. With this in mind, a new visual saliency architecture, named hierarchical visual saliency model, is proposed. Hierarchical visual saliency model is based on Itti's model and consists of two stages. In the first stage, Itti's model is used to calculate the saliency map, and Otsu's global thresholding algorithm is applied to extract the salient region that we are interested in. In the second stage, Itti's model is applied to the salient region to calculate the final saliency map. An experimental evaluation demonstrates that the proposed model outperforms Itti's model in terms of captured scene texts. Public Library of Science 2014-12-10 /pmc/articles/PMC4262416/ /pubmed/25494196 http://dx.doi.org/10.1371/journal.pone.0114539 Text en © 2014 Gao et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Gao, Renwu Uchida, Seiichi Shahab, Asif Shafait, Faisal Frinken, Volkmar Visual Saliency Models for Text Detection in Real World
title	Visual Saliency Models for Text Detection in Real World
title_full	Visual Saliency Models for Text Detection in Real World
title_fullStr	Visual Saliency Models for Text Detection in Real World
title_full_unstemmed	Visual Saliency Models for Text Detection in Real World
title_short	Visual Saliency Models for Text Detection in Real World
title_sort	visual saliency models for text detection in real world
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4262416/ https://www.ncbi.nlm.nih.gov/pubmed/25494196 http://dx.doi.org/10.1371/journal.pone.0114539
work_keys_str_mv	AT gaorenwu visualsaliencymodelsfortextdetectioninrealworld AT uchidaseiichi visualsaliencymodelsfortextdetectioninrealworld AT shahabasif visualsaliencymodelsfortextdetectioninrealworld AT shafaitfaisal visualsaliencymodelsfortextdetectioninrealworld AT frinkenvolkmar visualsaliencymodelsfortextdetectioninrealworld

Visual Saliency Models for Text Detection in Real World

Ejemplares similares