Cargando…

Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network

Deep learning is a subfield of artificial intelligence that allows the computer to adopt and learn some new rules. Deep learning algorithms can identify images, objects, observations, texts, and other structures. In recent years, scene text recognition has inspired many researchers from the computer...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kantipudi, MVV Prasad, Kumar, Sandeep, Kumar Jha, Ashish
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8632382/ https://www.ncbi.nlm.nih.gov/pubmed/34858492 http://dx.doi.org/10.1155/2021/2676780

_version_	1784607743804440576
author	Kantipudi, MVV Prasad Kumar, Sandeep Kumar Jha, Ashish
author_facet	Kantipudi, MVV Prasad Kumar, Sandeep Kumar Jha, Ashish
author_sort	Kantipudi, MVV Prasad
collection	PubMed
description	Deep learning is a subfield of artificial intelligence that allows the computer to adopt and learn some new rules. Deep learning algorithms can identify images, objects, observations, texts, and other structures. In recent years, scene text recognition has inspired many researchers from the computer vision community, and still, it needs improvement because of the poor performance of existing scene recognition algorithms. This research paper proposed a novel approach for scene text recognition that integrates bidirectional LSTM and deep convolution neural networks. In the proposed method, first, the contour of the image is identified and then it is fed into the CNN. CNN is used to generate the ordered sequence of the features from the contoured image. The sequence of features is now coded using the Bi-LSTM. Bi-LSTM is a handy tool for extracting the features from the sequence of words. Hence, this paper combines the two powerful mechanisms for extracting the features from the image, and contour-based input image makes the recognition process faster, which makes this technique better compared to existing methods. The results of the proposed methodology are evaluated on MSRATD 50 dataset, SVHN dataset, vehicle number plate dataset, SVT dataset, and random datasets, and the accuracy is 95.22%, 92.25%, 96.69%, 94.58%, and 98.12%, respectively. According to quantitative and qualitative analysis, this approach is more promising in terms of accuracy and precision rate.
format	Online Article Text
id	pubmed-8632382
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-86323822021-12-01 Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network Kantipudi, MVV Prasad Kumar, Sandeep Kumar Jha, Ashish Comput Intell Neurosci Research Article Deep learning is a subfield of artificial intelligence that allows the computer to adopt and learn some new rules. Deep learning algorithms can identify images, objects, observations, texts, and other structures. In recent years, scene text recognition has inspired many researchers from the computer vision community, and still, it needs improvement because of the poor performance of existing scene recognition algorithms. This research paper proposed a novel approach for scene text recognition that integrates bidirectional LSTM and deep convolution neural networks. In the proposed method, first, the contour of the image is identified and then it is fed into the CNN. CNN is used to generate the ordered sequence of the features from the contoured image. The sequence of features is now coded using the Bi-LSTM. Bi-LSTM is a handy tool for extracting the features from the sequence of words. Hence, this paper combines the two powerful mechanisms for extracting the features from the image, and contour-based input image makes the recognition process faster, which makes this technique better compared to existing methods. The results of the proposed methodology are evaluated on MSRATD 50 dataset, SVHN dataset, vehicle number plate dataset, SVT dataset, and random datasets, and the accuracy is 95.22%, 92.25%, 96.69%, 94.58%, and 98.12%, respectively. According to quantitative and qualitative analysis, this approach is more promising in terms of accuracy and precision rate. Hindawi 2021-11-23 /pmc/articles/PMC8632382/ /pubmed/34858492 http://dx.doi.org/10.1155/2021/2676780 Text en Copyright © 2021 MVV Prasad Kantipudi et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Kantipudi, MVV Prasad Kumar, Sandeep Kumar Jha, Ashish Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
title	Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
title_full	Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
title_fullStr	Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
title_full_unstemmed	Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
title_short	Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
title_sort	scene text recognition based on bidirectional lstm and deep neural network
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8632382/ https://www.ncbi.nlm.nih.gov/pubmed/34858492 http://dx.doi.org/10.1155/2021/2676780
work_keys_str_mv	AT kantipudimvvprasad scenetextrecognitionbasedonbidirectionallstmanddeepneuralnetwork AT kumarsandeep scenetextrecognitionbasedonbidirectionallstmanddeepneuralnetwork AT kumarjhaashish scenetextrecognitionbasedonbidirectionallstmanddeepneuralnetwork

Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network

Ejemplares similares