Cargando…

A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents

This paper proposes a performance model for estimating the user time needed to transcribe small collections of handwritten documents using a keyword spotting system (KWS) that provides a number of possible transcriptions for each word image. The model assumes that only information obtained from a sm...

Descripción completa

Detalles Bibliográficos
Autores principales: Marcelli, Angelo, De Gregorio, Giuseppe, Santoro, Adolfo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321178/
https://www.ncbi.nlm.nih.gov/pubmed/34460561
http://dx.doi.org/10.3390/jimaging6110117
_version_ 1783730789181030400
author Marcelli, Angelo
De Gregorio, Giuseppe
Santoro, Adolfo
author_facet Marcelli, Angelo
De Gregorio, Giuseppe
Santoro, Adolfo
author_sort Marcelli, Angelo
collection PubMed
description This paper proposes a performance model for estimating the user time needed to transcribe small collections of handwritten documents using a keyword spotting system (KWS) that provides a number of possible transcriptions for each word image. The model assumes that only information obtained from a small training set is available, and establishes the constraints on the performance measures to achieve a reduction of the time for transcribing the content with respect to the time required by human experts. The model is complemented with a procedure for computing the parameters of the model and eventually estimating the improvement of the time to achieve a complete and error-free transcription of the documents.
format Online
Article
Text
id pubmed-8321178
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83211782021-08-26 A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents Marcelli, Angelo De Gregorio, Giuseppe Santoro, Adolfo J Imaging Article This paper proposes a performance model for estimating the user time needed to transcribe small collections of handwritten documents using a keyword spotting system (KWS) that provides a number of possible transcriptions for each word image. The model assumes that only information obtained from a small training set is available, and establishes the constraints on the performance measures to achieve a reduction of the time for transcribing the content with respect to the time required by human experts. The model is complemented with a procedure for computing the parameters of the model and eventually estimating the improvement of the time to achieve a complete and error-free transcription of the documents. MDPI 2020-11-03 /pmc/articles/PMC8321178/ /pubmed/34460561 http://dx.doi.org/10.3390/jimaging6110117 Text en © 2020 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Marcelli, Angelo
De Gregorio, Giuseppe
Santoro, Adolfo
A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents
title A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents
title_full A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents
title_fullStr A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents
title_full_unstemmed A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents
title_short A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents
title_sort model for evaluating the performance of a multiple keywords spotting system for the transcription of historical handwritten documents
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321178/
https://www.ncbi.nlm.nih.gov/pubmed/34460561
http://dx.doi.org/10.3390/jimaging6110117
work_keys_str_mv AT marcelliangelo amodelforevaluatingtheperformanceofamultiplekeywordsspottingsystemforthetranscriptionofhistoricalhandwrittendocuments
AT degregoriogiuseppe amodelforevaluatingtheperformanceofamultiplekeywordsspottingsystemforthetranscriptionofhistoricalhandwrittendocuments
AT santoroadolfo amodelforevaluatingtheperformanceofamultiplekeywordsspottingsystemforthetranscriptionofhistoricalhandwrittendocuments
AT marcelliangelo modelforevaluatingtheperformanceofamultiplekeywordsspottingsystemforthetranscriptionofhistoricalhandwrittendocuments
AT degregoriogiuseppe modelforevaluatingtheperformanceofamultiplekeywordsspottingsystemforthetranscriptionofhistoricalhandwrittendocuments
AT santoroadolfo modelforevaluatingtheperformanceofamultiplekeywordsspottingsystemforthetranscriptionofhistoricalhandwrittendocuments