Cargando…
Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
Training deep learning based handwritten text recognition systems needs a lot of data in terms of text images and their corresponding annotations. One way to deal with this issue is to use data augmentation techniques to increase the amount of training data. Generative Adversarial Networks (GANs) ba...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8802770/ https://www.ncbi.nlm.nih.gov/pubmed/35174276 http://dx.doi.org/10.7717/peerj-cs.861 |
_version_ | 1784642738880249856 |
---|---|
author | Eltay, Mohamed Zidouri, Abdelmalek Ahmad, Irfan Elarian, Yousef |
author_facet | Eltay, Mohamed Zidouri, Abdelmalek Ahmad, Irfan Elarian, Yousef |
author_sort | Eltay, Mohamed |
collection | PubMed |
description | Training deep learning based handwritten text recognition systems needs a lot of data in terms of text images and their corresponding annotations. One way to deal with this issue is to use data augmentation techniques to increase the amount of training data. Generative Adversarial Networks (GANs) based data augmentation techniques are popular in literature especially in tasks related to images. However, specific challenges need to be addressed in order to effectively use GANs for data augmentation in the domain of text recognition. Text data is inherently imbalanced in terms of frequency of different characters appearing in training samples and the training data as a whole. GANs trained on the imbalanced dataset leads to augmented data that does not represent the minority characters well. In this paper, we present an adaptive data augmentation technique using GANs that deals with the issue of class imbalance arising in text recognition problems. We show, using experimental evaluations on two publicly available datasets for handwritten Arabic text recognition, that the GANs trained using the presented technique is effective in dealing with class imbalanced problem by generating augmented data that is balanced in terms of character frequencies. The resulting text recognition systems trained on the balanced augmented data improves the text recognition accuracy as compared to the systems trained using standard techniques. |
format | Online Article Text |
id | pubmed-8802770 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88027702022-02-15 Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition Eltay, Mohamed Zidouri, Abdelmalek Ahmad, Irfan Elarian, Yousef PeerJ Comput Sci Artificial Intelligence Training deep learning based handwritten text recognition systems needs a lot of data in terms of text images and their corresponding annotations. One way to deal with this issue is to use data augmentation techniques to increase the amount of training data. Generative Adversarial Networks (GANs) based data augmentation techniques are popular in literature especially in tasks related to images. However, specific challenges need to be addressed in order to effectively use GANs for data augmentation in the domain of text recognition. Text data is inherently imbalanced in terms of frequency of different characters appearing in training samples and the training data as a whole. GANs trained on the imbalanced dataset leads to augmented data that does not represent the minority characters well. In this paper, we present an adaptive data augmentation technique using GANs that deals with the issue of class imbalance arising in text recognition problems. We show, using experimental evaluations on two publicly available datasets for handwritten Arabic text recognition, that the GANs trained using the presented technique is effective in dealing with class imbalanced problem by generating augmented data that is balanced in terms of character frequencies. The resulting text recognition systems trained on the balanced augmented data improves the text recognition accuracy as compared to the systems trained using standard techniques. PeerJ Inc. 2022-01-25 /pmc/articles/PMC8802770/ /pubmed/35174276 http://dx.doi.org/10.7717/peerj-cs.861 Text en ©2022 Eltay et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Artificial Intelligence Eltay, Mohamed Zidouri, Abdelmalek Ahmad, Irfan Elarian, Yousef Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition |
title | Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition |
title_full | Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition |
title_fullStr | Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition |
title_full_unstemmed | Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition |
title_short | Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition |
title_sort | generative adversarial network based adaptive data augmentation for handwritten arabic text recognition |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8802770/ https://www.ncbi.nlm.nih.gov/pubmed/35174276 http://dx.doi.org/10.7717/peerj-cs.861 |
work_keys_str_mv | AT eltaymohamed generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition AT zidouriabdelmalek generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition AT ahmadirfan generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition AT elarianyousef generativeadversarialnetworkbasedadaptivedataaugmentationforhandwrittenarabictextrecognition |