Cargando…

Writer verification of partially damaged handwritten Arabic documents based on individual character shapes

Author verification of handwritten text is required in several application domains and has drawn a lot of attention within the research community due to its importance. Though, several approaches have been proposed for the text-independent writer verification of handwritten text, none of these have...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Majid A., Mohammad, Nazeeruddin, Ben Brahim, Ghassen, Bashar, Abul, Latif, Ghazanfar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044228/
https://www.ncbi.nlm.nih.gov/pubmed/35494816
http://dx.doi.org/10.7717/peerj-cs.955
_version_ 1784695059573112832
author Khan, Majid A.
Mohammad, Nazeeruddin
Ben Brahim, Ghassen
Bashar, Abul
Latif, Ghazanfar
author_facet Khan, Majid A.
Mohammad, Nazeeruddin
Ben Brahim, Ghassen
Bashar, Abul
Latif, Ghazanfar
author_sort Khan, Majid A.
collection PubMed
description Author verification of handwritten text is required in several application domains and has drawn a lot of attention within the research community due to its importance. Though, several approaches have been proposed for the text-independent writer verification of handwritten text, none of these have addressed the problem domain where author verification is sought based on partially-damaged handwritten documents (e.g., during forensic analysis). In this paper, we propose an approach for offline text-independent writer verification of handwritten Arabic text based on individual character shapes (within the Arabic alphabet). The proposed approach enables writer verification for partially damaged documents where certain handwritten characters can still be extracted from the damaged document. We also provide a mechanism to identify which Arabic characters are more effective during the writer verification process. We have collected a new dataset, Arabic Handwritten Alphabet, Words and Paragraphs Per User (AHAWP), for this purpose in a classroom setting with 82 different users. The dataset consists of 53,199 user-written isolated Arabic characters, 8,144 Arabic words, 10,780 characters extracted from these words. Convolutional neural network (CNN) based models are developed for verification of writers based on individual characters with an accuracy of 94% for isolated character shapes and 90% for extracted character shapes. Our proposed approach provided up to 95% writer verification accuracy for partially damaged documents.
format Online
Article
Text
id pubmed-9044228
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-90442282022-04-28 Writer verification of partially damaged handwritten Arabic documents based on individual character shapes Khan, Majid A. Mohammad, Nazeeruddin Ben Brahim, Ghassen Bashar, Abul Latif, Ghazanfar PeerJ Comput Sci Artificial Intelligence Author verification of handwritten text is required in several application domains and has drawn a lot of attention within the research community due to its importance. Though, several approaches have been proposed for the text-independent writer verification of handwritten text, none of these have addressed the problem domain where author verification is sought based on partially-damaged handwritten documents (e.g., during forensic analysis). In this paper, we propose an approach for offline text-independent writer verification of handwritten Arabic text based on individual character shapes (within the Arabic alphabet). The proposed approach enables writer verification for partially damaged documents where certain handwritten characters can still be extracted from the damaged document. We also provide a mechanism to identify which Arabic characters are more effective during the writer verification process. We have collected a new dataset, Arabic Handwritten Alphabet, Words and Paragraphs Per User (AHAWP), for this purpose in a classroom setting with 82 different users. The dataset consists of 53,199 user-written isolated Arabic characters, 8,144 Arabic words, 10,780 characters extracted from these words. Convolutional neural network (CNN) based models are developed for verification of writers based on individual characters with an accuracy of 94% for isolated character shapes and 90% for extracted character shapes. Our proposed approach provided up to 95% writer verification accuracy for partially damaged documents. PeerJ Inc. 2022-04-20 /pmc/articles/PMC9044228/ /pubmed/35494816 http://dx.doi.org/10.7717/peerj-cs.955 Text en ©2022 Khan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Khan, Majid A.
Mohammad, Nazeeruddin
Ben Brahim, Ghassen
Bashar, Abul
Latif, Ghazanfar
Writer verification of partially damaged handwritten Arabic documents based on individual character shapes
title Writer verification of partially damaged handwritten Arabic documents based on individual character shapes
title_full Writer verification of partially damaged handwritten Arabic documents based on individual character shapes
title_fullStr Writer verification of partially damaged handwritten Arabic documents based on individual character shapes
title_full_unstemmed Writer verification of partially damaged handwritten Arabic documents based on individual character shapes
title_short Writer verification of partially damaged handwritten Arabic documents based on individual character shapes
title_sort writer verification of partially damaged handwritten arabic documents based on individual character shapes
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044228/
https://www.ncbi.nlm.nih.gov/pubmed/35494816
http://dx.doi.org/10.7717/peerj-cs.955
work_keys_str_mv AT khanmajida writerverificationofpartiallydamagedhandwrittenarabicdocumentsbasedonindividualcharactershapes
AT mohammadnazeeruddin writerverificationofpartiallydamagedhandwrittenarabicdocumentsbasedonindividualcharactershapes
AT benbrahimghassen writerverificationofpartiallydamagedhandwrittenarabicdocumentsbasedonindividualcharactershapes
AT basharabul writerverificationofpartiallydamagedhandwrittenarabicdocumentsbasedonindividualcharactershapes
AT latifghazanfar writerverificationofpartiallydamagedhandwrittenarabicdocumentsbasedonindividualcharactershapes