Cargando…

Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images

Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or...

Descripción completa

Detalles Bibliográficos
Autores principales: Yao, Cong, Zhang, Xin, Bai, Xiang, Liu, Wenyu, Ma, Yi, Tu, Zhuowen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3734103/
https://www.ncbi.nlm.nih.gov/pubmed/23940544
http://dx.doi.org/10.1371/journal.pone.0070173
_version_ 1782279473246240768
author Yao, Cong
Zhang, Xin
Bai, Xiang
Liu, Wenyu
Ma, Yi
Tu, Zhuowen
author_facet Yao, Cong
Zhang, Xin
Bai, Xiang
Liu, Wenyu
Ma, Yi
Tu, Zhuowen
author_sort Yao, Cong
collection PubMed
description Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or near-horizontal) texts. Due to the increasing popularity of mobile-computing devices and applications, detecting texts of varying orientations from natural images under less controlled conditions has become an important but challenging task. In this paper, we propose a new algorithm to detect texts of varying orientations. Our algorithm is based on a two-level classification scheme and two sets of features specially designed for capturing the intrinsic characteristics of texts. To better evaluate the proposed method and compare it with the competing algorithms, we generate a comprehensive dataset with various types of texts in diverse real-world scenes. We also propose a new evaluation protocol, which is more suitable for benchmarking algorithms for detecting texts in varying orientations. Experiments on benchmark datasets demonstrate that our system compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on variant texts in complex natural scenes.
format Online
Article
Text
id pubmed-3734103
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37341032013-08-12 Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images Yao, Cong Zhang, Xin Bai, Xiang Liu, Wenyu Ma, Yi Tu, Zhuowen PLoS One Research Article Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or near-horizontal) texts. Due to the increasing popularity of mobile-computing devices and applications, detecting texts of varying orientations from natural images under less controlled conditions has become an important but challenging task. In this paper, we propose a new algorithm to detect texts of varying orientations. Our algorithm is based on a two-level classification scheme and two sets of features specially designed for capturing the intrinsic characteristics of texts. To better evaluate the proposed method and compare it with the competing algorithms, we generate a comprehensive dataset with various types of texts in diverse real-world scenes. We also propose a new evaluation protocol, which is more suitable for benchmarking algorithms for detecting texts in varying orientations. Experiments on benchmark datasets demonstrate that our system compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on variant texts in complex natural scenes. Public Library of Science 2013-08-05 /pmc/articles/PMC3734103/ /pubmed/23940544 http://dx.doi.org/10.1371/journal.pone.0070173 Text en © 2013 Yao et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Yao, Cong
Zhang, Xin
Bai, Xiang
Liu, Wenyu
Ma, Yi
Tu, Zhuowen
Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
title Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
title_full Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
title_fullStr Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
title_full_unstemmed Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
title_short Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
title_sort rotation-invariant features for multi-oriented text detection in natural images
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3734103/
https://www.ncbi.nlm.nih.gov/pubmed/23940544
http://dx.doi.org/10.1371/journal.pone.0070173
work_keys_str_mv AT yaocong rotationinvariantfeaturesformultiorientedtextdetectioninnaturalimages
AT zhangxin rotationinvariantfeaturesformultiorientedtextdetectioninnaturalimages
AT baixiang rotationinvariantfeaturesformultiorientedtextdetectioninnaturalimages
AT liuwenyu rotationinvariantfeaturesformultiorientedtextdetectioninnaturalimages
AT mayi rotationinvariantfeaturesformultiorientedtextdetectioninnaturalimages
AT tuzhuowen rotationinvariantfeaturesformultiorientedtextdetectioninnaturalimages