Cargando…

Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition

Training deep learning based handwritten text recognition systems needs a lot of data in terms of text images and their corresponding annotations. One way to deal with this issue is to use data augmentation techniques to increase the amount of training data. Generative Adversarial Networks (GANs) ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Eltay, Mohamed, Zidouri, Abdelmalek, Ahmad, Irfan, Elarian, Yousef
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8802770/
https://www.ncbi.nlm.nih.gov/pubmed/35174276
http://dx.doi.org/10.7717/peerj-cs.861
_version_ 1784642738880249856
author Eltay, Mohamed
Zidouri, Abdelmalek
Ahmad, Irfan
Elarian, Yousef
author_facet Eltay, Mohamed
Zidouri, Abdelmalek
Ahmad, Irfan
Elarian, Yousef
author_sort Eltay, Mohamed
collection PubMed
description Training deep learning based handwritten text recognition systems needs a lot of data in terms of text images and their corresponding annotations. One way to deal with this issue is to use data augmentation techniques to increase the amount of training data. Generative Adversarial Networks (GANs) based data augmentation techniques are popular in literature especially in tasks related to images. However, specific challenges need to be addressed in order to effectively use GANs for data augmentation in the domain of text recognition. Text data is inherently imbalanced in terms of frequency of different characters appearing in training samples and the training data as a whole. GANs trained on the imbalanced dataset leads to augmented data that does not represent the minority characters well. In this paper, we present an adaptive data augmentation technique using GANs that deals with the issue of class imbalance arising in text recognition problems. We show, using experimental evaluations on two publicly available datasets for handwritten Arabic text recognition, that the GANs trained using the presented technique is effective in dealing with class imbalanced problem by generating augmented data that is balanced in terms of character frequencies. The resulting text recognition systems trained on the balanced augmented data improves the text recognition accuracy as compared to the systems trained using standard techniques.
format Online
Article
Text
id pubmed-8802770
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-88027702022-02-15 Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition Eltay, Mohamed Zidouri, Abdelmalek Ahmad, Irfan Elarian, Yousef PeerJ Comput Sci Artificial Intelligence Training deep learning based handwritten text recognition systems needs a lot of data in terms of text images and their corresponding annotations. One way to deal with this issue is to use data augmentation techniques to increase the amount of training data. Generative Adversarial Networks (GANs) based data augmentation techniques are popular in literature especially in tasks related to images. However, specific challenges need to be addressed in order to effectively use GANs for data augmentation in the domain of text recognition. Text data is inherently imbalanced in terms of frequency of different characters appearing in training samples and the training data as a whole. GANs trained on the imbalanced dataset leads to augmented data that does not represent the minority characters well. In this paper, we present an adaptive data augmentation technique using GANs that deals with the issue of class imbalance arising in text recognition problems. We show, using experimental evaluations on two publicly available datasets for handwritten Arabic text recognition, that the GANs trained using the presented technique is effective in dealing with class imbalanced problem by generating augmented data that is balanced in terms of character frequencies. The resulting text recognition systems trained on the balanced augmented data improves the text recognition accuracy as compared to the systems trained using standard techniques. PeerJ Inc. 2022-01-25 /pmc/articles/PMC8802770/ /pubmed/35174276 http://dx.doi.org/10.7717/peerj-cs.861 Text en ©2022 Eltay et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Eltay, Mohamed
Zidouri, Abdelmalek
Ahmad, Irfan
Elarian, Yousef
Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
title Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
title_full Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
title_fullStr Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
title_full_unstemmed Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
title_short Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
title_sort generative adversarial network based adaptive data augmentation for handwritten arabic text recognition
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8802770/
https://www.ncbi.nlm.nih.gov/pubmed/35174276
http://dx.doi.org/10.7717/peerj-cs.861
work_keys_str_mv AT eltaymohamed generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition
AT zidouriabdelmalek generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition
AT ahmadirfan generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition
AT elarianyousef generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition