Cargando…

Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction

Text line segmentation is an essential stage in off-line optical character recognition (OCR) systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwri...

Descripción completa

Detalles Bibliográficos
Autores principales: Brodić, Darko, Milivojević, Dragan R., Milivojević, Zoran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Molecular Diversity Preservation International (MDPI) 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3292172/
https://www.ncbi.nlm.nih.gov/pubmed/22399932
http://dx.doi.org/10.3390/s100505263
_version_ 1782225249249525760
author Brodić, Darko
Milivojević, Dragan R.
Milivojević, Zoran
author_facet Brodić, Darko
Milivojević, Dragan R.
Milivojević, Zoran
author_sort Brodić, Darko
collection PubMed
description Text line segmentation is an essential stage in off-line optical character recognition (OCR) systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, some basic set of measurement methods is required. Currently, there is no commonly accepted one and all algorithm evaluation is custom oriented. In this paper, a basic test framework for the evaluation of text feature extraction algorithms is proposed. This test framework consists of a few experiments primarily linked to text line segmentation, skew rate and reference text line evaluation. Although they are mutually independent, the results obtained are strongly cross linked. In the end, its suitability for different types of letters and languages as well as its adaptability are its main advantages. Thus, the paper presents an efficient evaluation method for text analysis algorithms.
format Online
Article
Text
id pubmed-3292172
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Molecular Diversity Preservation International (MDPI)
record_format MEDLINE/PubMed
spelling pubmed-32921722012-03-07 Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction Brodić, Darko Milivojević, Dragan R. Milivojević, Zoran Sensors (Basel) Article Text line segmentation is an essential stage in off-line optical character recognition (OCR) systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, some basic set of measurement methods is required. Currently, there is no commonly accepted one and all algorithm evaluation is custom oriented. In this paper, a basic test framework for the evaluation of text feature extraction algorithms is proposed. This test framework consists of a few experiments primarily linked to text line segmentation, skew rate and reference text line evaluation. Although they are mutually independent, the results obtained are strongly cross linked. In the end, its suitability for different types of letters and languages as well as its adaptability are its main advantages. Thus, the paper presents an efficient evaluation method for text analysis algorithms. Molecular Diversity Preservation International (MDPI) 2010-05-25 /pmc/articles/PMC3292172/ /pubmed/22399932 http://dx.doi.org/10.3390/s100505263 Text en © 2010 by the authors; licensee MDPI, Basel, Switzerland. This article is an Open Access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Brodić, Darko
Milivojević, Dragan R.
Milivojević, Zoran
Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction
title Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction
title_full Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction
title_fullStr Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction
title_full_unstemmed Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction
title_short Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction
title_sort basic test framework for the evaluation of text line segmentation and text parameter extraction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3292172/
https://www.ncbi.nlm.nih.gov/pubmed/22399932
http://dx.doi.org/10.3390/s100505263
work_keys_str_mv AT brodicdarko basictestframeworkfortheevaluationoftextlinesegmentationandtextparameterextraction
AT milivojevicdraganr basictestframeworkfortheevaluationoftextlinesegmentationandtextparameterextraction
AT milivojeviczoran basictestframeworkfortheevaluationoftextlinesegmentationandtextparameterextraction