Cargando…

Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions

In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old...

Descripción completa

Detalles Bibliográficos
Autores principales: Sulaiman, Alaa, Omar, Khairuddin, Nasrudin, Mohammad F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8320943/
https://www.ncbi.nlm.nih.gov/pubmed/34460486
http://dx.doi.org/10.3390/jimaging5040048
_version_ 1783730733523664896
author Sulaiman, Alaa
Omar, Khairuddin
Nasrudin, Mohammad F.
author_facet Sulaiman, Alaa
Omar, Khairuddin
Nasrudin, Mohammad F.
author_sort Sulaiman, Alaa
collection PubMed
description In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old legal records, medical reports, music scores, palm leaf, and reports on security-related issues. In particular, ancient and historical documents are hard to read due to their degradation in terms of low contrast and existence of corrupted artefacts. In recent times, degraded document binarization has been studied widely and several approaches were developed to deal with issues and challenges in document binarization. In this paper, a comprehensive review is conducted on the issues and challenges faced during the image binarization process, followed by insights on various methods used for image binarization. This paper also discusses the advanced methods used for the enhancement of degraded documents that improves the quality of documents during the binarization process. Further discussions are made on the effectiveness and robustness of existing methods, and there is still a scope to develop a hybrid approach that can deal with degraded document binarization more effectively.
format Online
Article
Text
id pubmed-8320943
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83209432021-08-26 Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions Sulaiman, Alaa Omar, Khairuddin Nasrudin, Mohammad F. J Imaging Review In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old legal records, medical reports, music scores, palm leaf, and reports on security-related issues. In particular, ancient and historical documents are hard to read due to their degradation in terms of low contrast and existence of corrupted artefacts. In recent times, degraded document binarization has been studied widely and several approaches were developed to deal with issues and challenges in document binarization. In this paper, a comprehensive review is conducted on the issues and challenges faced during the image binarization process, followed by insights on various methods used for image binarization. This paper also discusses the advanced methods used for the enhancement of degraded documents that improves the quality of documents during the binarization process. Further discussions are made on the effectiveness and robustness of existing methods, and there is still a scope to develop a hybrid approach that can deal with degraded document binarization more effectively. MDPI 2019-04-12 /pmc/articles/PMC8320943/ /pubmed/34460486 http://dx.doi.org/10.3390/jimaging5040048 Text en © 2019 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Review
Sulaiman, Alaa
Omar, Khairuddin
Nasrudin, Mohammad F.
Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
title Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
title_full Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
title_fullStr Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
title_full_unstemmed Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
title_short Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
title_sort degraded historical document binarization: a review on issues, challenges, techniques, and future directions
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8320943/
https://www.ncbi.nlm.nih.gov/pubmed/34460486
http://dx.doi.org/10.3390/jimaging5040048
work_keys_str_mv AT sulaimanalaa degradedhistoricaldocumentbinarizationareviewonissueschallengestechniquesandfuturedirections
AT omarkhairuddin degradedhistoricaldocumentbinarizationareviewonissueschallengestechniquesandfuturedirections
AT nasrudinmohammadf degradedhistoricaldocumentbinarizationareviewonissueschallengestechniquesandfuturedirections