Cargando…
Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text
This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU)...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321198/ https://www.ncbi.nlm.nih.gov/pubmed/34460538 http://dx.doi.org/10.3390/jimaging6120141 |
_version_ | 1783730793853485056 |
---|---|
author | Abdallah, Abdelrahman Hamada, Mohamed Nurseitov, Daniyar |
author_facet | Abdallah, Abdelrahman Hamada, Mohamed Nurseitov, Daniyar |
author_sort | Abdallah, Abdelrahman |
collection | PubMed |
description | This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU) and attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER), and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Our proposed model is the first work to handle handwriting recognition models in Kazakh and Russian languages. Our results confirm the importance of our proposed Attention-Gated-CNN-BGRU approach for training handwriting text recognition and indicate that it can lead to statistically significant improvements (p-value < 0.05) in the sensitivity (recall) over the tests dataset. The proposed method’s performance was evaluated using handwritten text databases of three languages: English, Russian, and Kazakh. It demonstrates better results on the Handwritten Kazakh and Russian (HKR) dataset than the other well-known models. |
format | Online Article Text |
id | pubmed-8321198 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83211982021-08-26 Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text Abdallah, Abdelrahman Hamada, Mohamed Nurseitov, Daniyar J Imaging Article This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU) and attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER), and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Our proposed model is the first work to handle handwriting recognition models in Kazakh and Russian languages. Our results confirm the importance of our proposed Attention-Gated-CNN-BGRU approach for training handwriting text recognition and indicate that it can lead to statistically significant improvements (p-value < 0.05) in the sensitivity (recall) over the tests dataset. The proposed method’s performance was evaluated using handwritten text databases of three languages: English, Russian, and Kazakh. It demonstrates better results on the Handwritten Kazakh and Russian (HKR) dataset than the other well-known models. MDPI 2020-12-18 /pmc/articles/PMC8321198/ /pubmed/34460538 http://dx.doi.org/10.3390/jimaging6120141 Text en © 2020 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ). |
spellingShingle | Article Abdallah, Abdelrahman Hamada, Mohamed Nurseitov, Daniyar Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text |
title | Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text |
title_full | Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text |
title_fullStr | Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text |
title_full_unstemmed | Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text |
title_short | Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text |
title_sort | attention-based fully gated cnn-bgru for russian handwritten text |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321198/ https://www.ncbi.nlm.nih.gov/pubmed/34460538 http://dx.doi.org/10.3390/jimaging6120141 |
work_keys_str_mv | AT abdallahabdelrahman attentionbasedfullygatedcnnbgruforrussianhandwrittentext AT hamadamohamed attentionbasedfullygatedcnnbgruforrussianhandwrittentext AT nurseitovdaniyar attentionbasedfullygatedcnnbgruforrussianhandwrittentext |