Cargando…

An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms

The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databa...

Descripción completa

Detalles Bibliográficos
Autores principales: Brodic, Darko, Milivojevic, Dragan R., Milivojevic, Zoran N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Molecular Diversity Preservation International (MDPI) 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3231474/
https://www.ncbi.nlm.nih.gov/pubmed/22164106
http://dx.doi.org/10.3390/s110908782
_version_ 1782218230270525440
author Brodic, Darko
Milivojevic, Dragan R.
Milivojevic, Zoran N.
author_facet Brodic, Darko
Milivojevic, Dragan R.
Milivojevic, Zoran N.
author_sort Brodic, Darko
collection PubMed
description The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databases as reference templates. Because of the mismatch, the reliable testing framework is required. Hence, a new approach to a comprehensive experimental framework for the evaluation of text line segmentation algorithms is proposed. It consists of synthetic multi-like text samples and real handwritten text as well. Although the tests are mutually independent, the results are cross-linked. The proposed method can be used for different types of scripts and languages. Furthermore, two different procedures for the evaluation of algorithm efficiency based on the obtained error type classification are proposed. The first is based on the segmentation line error description, while the second one incorporates well-known signal detection theory. Each of them has different capabilities and convenience, but they can be used as supplements to make the evaluation process efficient. Overall the proposed procedure based on the segmentation line error description has some advantages, characterized by five measures that describe measurement procedures.
format Online
Article
Text
id pubmed-3231474
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Molecular Diversity Preservation International (MDPI)
record_format MEDLINE/PubMed
spelling pubmed-32314742011-12-07 An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms Brodic, Darko Milivojevic, Dragan R. Milivojevic, Zoran N. Sensors (Basel) Article The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databases as reference templates. Because of the mismatch, the reliable testing framework is required. Hence, a new approach to a comprehensive experimental framework for the evaluation of text line segmentation algorithms is proposed. It consists of synthetic multi-like text samples and real handwritten text as well. Although the tests are mutually independent, the results are cross-linked. The proposed method can be used for different types of scripts and languages. Furthermore, two different procedures for the evaluation of algorithm efficiency based on the obtained error type classification are proposed. The first is based on the segmentation line error description, while the second one incorporates well-known signal detection theory. Each of them has different capabilities and convenience, but they can be used as supplements to make the evaluation process efficient. Overall the proposed procedure based on the segmentation line error description has some advantages, characterized by five measures that describe measurement procedures. Molecular Diversity Preservation International (MDPI) 2011-09-13 /pmc/articles/PMC3231474/ /pubmed/22164106 http://dx.doi.org/10.3390/s110908782 Text en © 2011 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Brodic, Darko
Milivojevic, Dragan R.
Milivojevic, Zoran N.
An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_full An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_fullStr An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_full_unstemmed An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_short An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_sort approach to a comprehensive test framework for analysis and evaluation of text line segmentation algorithms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3231474/
https://www.ncbi.nlm.nih.gov/pubmed/22164106
http://dx.doi.org/10.3390/s110908782
work_keys_str_mv AT brodicdarko anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT milivojevicdraganr anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT milivojeviczorann anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT brodicdarko approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT milivojevicdraganr approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT milivojeviczorann approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms