Cargando…

Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text

This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU)...

Descripción completa

Detalles Bibliográficos
Autores principales: Abdallah, Abdelrahman, Hamada, Mohamed, Nurseitov, Daniyar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321198/
https://www.ncbi.nlm.nih.gov/pubmed/34460538
http://dx.doi.org/10.3390/jimaging6120141
_version_ 1783730793853485056
author Abdallah, Abdelrahman
Hamada, Mohamed
Nurseitov, Daniyar
author_facet Abdallah, Abdelrahman
Hamada, Mohamed
Nurseitov, Daniyar
author_sort Abdallah, Abdelrahman
collection PubMed
description This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU) and attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER), and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Our proposed model is the first work to handle handwriting recognition models in Kazakh and Russian languages. Our results confirm the importance of our proposed Attention-Gated-CNN-BGRU approach for training handwriting text recognition and indicate that it can lead to statistically significant improvements (p-value < 0.05) in the sensitivity (recall) over the tests dataset. The proposed method’s performance was evaluated using handwritten text databases of three languages: English, Russian, and Kazakh. It demonstrates better results on the Handwritten Kazakh and Russian (HKR) dataset than the other well-known models.
format Online
Article
Text
id pubmed-8321198
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83211982021-08-26 Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text Abdallah, Abdelrahman Hamada, Mohamed Nurseitov, Daniyar J Imaging Article This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU) and attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER), and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Our proposed model is the first work to handle handwriting recognition models in Kazakh and Russian languages. Our results confirm the importance of our proposed Attention-Gated-CNN-BGRU approach for training handwriting text recognition and indicate that it can lead to statistically significant improvements (p-value < 0.05) in the sensitivity (recall) over the tests dataset. The proposed method’s performance was evaluated using handwritten text databases of three languages: English, Russian, and Kazakh. It demonstrates better results on the Handwritten Kazakh and Russian (HKR) dataset than the other well-known models. MDPI 2020-12-18 /pmc/articles/PMC8321198/ /pubmed/34460538 http://dx.doi.org/10.3390/jimaging6120141 Text en © 2020 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Abdallah, Abdelrahman
Hamada, Mohamed
Nurseitov, Daniyar
Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text
title Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text
title_full Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text
title_fullStr Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text
title_full_unstemmed Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text
title_short Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text
title_sort attention-based fully gated cnn-bgru for russian handwritten text
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321198/
https://www.ncbi.nlm.nih.gov/pubmed/34460538
http://dx.doi.org/10.3390/jimaging6120141
work_keys_str_mv AT abdallahabdelrahman attentionbasedfullygatedcnnbgruforrussianhandwrittentext
AT hamadamohamed attentionbasedfullygatedcnnbgruforrussianhandwrittentext
AT nurseitovdaniyar attentionbasedfullygatedcnnbgruforrussianhandwrittentext