Cargando…

Digital Hebrew Paleography: Script Types and Modes

Paleography is the study of ancient and medieval handwriting. It is essential for understanding, authenticating, and dating historical texts. Across many archives and libraries, many handwritten manuscripts are yet to be classified. Human experts can process a limited number of manuscripts; therefor...

Descripción completa

Detalles Bibliográficos
Autores principales: Droby, Ahmad, Rabaev, Irina, Shapira, Daria Vasyutinsky, Kurar Barakat, Berat, El-Sana, Jihad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9146803/
https://www.ncbi.nlm.nih.gov/pubmed/35621907
http://dx.doi.org/10.3390/jimaging8050143
_version_ 1784716652564185088
author Droby, Ahmad
Rabaev, Irina
Shapira, Daria Vasyutinsky
Kurar Barakat, Berat
El-Sana, Jihad
author_facet Droby, Ahmad
Rabaev, Irina
Shapira, Daria Vasyutinsky
Kurar Barakat, Berat
El-Sana, Jihad
author_sort Droby, Ahmad
collection PubMed
description Paleography is the study of ancient and medieval handwriting. It is essential for understanding, authenticating, and dating historical texts. Across many archives and libraries, many handwritten manuscripts are yet to be classified. Human experts can process a limited number of manuscripts; therefore, there is a need for an automatic tool for script type classification. In this study, we utilize a deep-learning methodology to classify medieval Hebrew manuscripts into 14 classes based on their script style and mode. Hebrew paleography recognizes six regional styles and three graphical modes of scripts. We experiment with several input image representations and network architectures to determine the appropriate ones and explore several approaches for script classification. We obtained the highest accuracy using hierarchical classification approach. At the first level, the regional style of the script is classified. Then, the patch is passed to the corresponding model at the second level to determine the graphical mode. In addition, we explore the use of soft labels to define a value we call squareness value that indicates the squareness/cursiveness of the script. We show how the graphical mode labels can be redefined using the squareness value. This redefinition increases the classification accuracy significantly. Finally, we show that the automatic classification is on-par with a human expert paleographer.
format Online
Article
Text
id pubmed-9146803
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91468032022-05-29 Digital Hebrew Paleography: Script Types and Modes Droby, Ahmad Rabaev, Irina Shapira, Daria Vasyutinsky Kurar Barakat, Berat El-Sana, Jihad J Imaging Article Paleography is the study of ancient and medieval handwriting. It is essential for understanding, authenticating, and dating historical texts. Across many archives and libraries, many handwritten manuscripts are yet to be classified. Human experts can process a limited number of manuscripts; therefore, there is a need for an automatic tool for script type classification. In this study, we utilize a deep-learning methodology to classify medieval Hebrew manuscripts into 14 classes based on their script style and mode. Hebrew paleography recognizes six regional styles and three graphical modes of scripts. We experiment with several input image representations and network architectures to determine the appropriate ones and explore several approaches for script classification. We obtained the highest accuracy using hierarchical classification approach. At the first level, the regional style of the script is classified. Then, the patch is passed to the corresponding model at the second level to determine the graphical mode. In addition, we explore the use of soft labels to define a value we call squareness value that indicates the squareness/cursiveness of the script. We show how the graphical mode labels can be redefined using the squareness value. This redefinition increases the classification accuracy significantly. Finally, we show that the automatic classification is on-par with a human expert paleographer. MDPI 2022-05-21 /pmc/articles/PMC9146803/ /pubmed/35621907 http://dx.doi.org/10.3390/jimaging8050143 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Droby, Ahmad
Rabaev, Irina
Shapira, Daria Vasyutinsky
Kurar Barakat, Berat
El-Sana, Jihad
Digital Hebrew Paleography: Script Types and Modes
title Digital Hebrew Paleography: Script Types and Modes
title_full Digital Hebrew Paleography: Script Types and Modes
title_fullStr Digital Hebrew Paleography: Script Types and Modes
title_full_unstemmed Digital Hebrew Paleography: Script Types and Modes
title_short Digital Hebrew Paleography: Script Types and Modes
title_sort digital hebrew paleography: script types and modes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9146803/
https://www.ncbi.nlm.nih.gov/pubmed/35621907
http://dx.doi.org/10.3390/jimaging8050143
work_keys_str_mv AT drobyahmad digitalhebrewpaleographyscripttypesandmodes
AT rabaevirina digitalhebrewpaleographyscripttypesandmodes
AT shapiradariavasyutinsky digitalhebrewpaleographyscripttypesandmodes
AT kurarbarakatberat digitalhebrewpaleographyscripttypesandmodes
AT elsanajihad digitalhebrewpaleographyscripttypesandmodes