Cargando…

Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification

Blind people can benefit greatly from a system capable of localising and reading comprehension text embedded in natural scenes and providing useful information that boosts their self-esteem and autonomy in everyday situations. Regardless of the fact that existing optical character recognition progra...

Descripción completa

Detalles Bibliográficos
Autores principales: Pandey, Binay Kumar, Pandey, Digvijay, Wariya, Subodh, Aggarwal, Gaurav, Rastogi, Rahul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Singapore 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553597/
http://dx.doi.org/10.1007/s41133-021-00051-5
_version_ 1784591614524522496
author Pandey, Binay Kumar
Pandey, Digvijay
Wariya, Subodh
Aggarwal, Gaurav
Rastogi, Rahul
author_facet Pandey, Binay Kumar
Pandey, Digvijay
Wariya, Subodh
Aggarwal, Gaurav
Rastogi, Rahul
author_sort Pandey, Binay Kumar
collection PubMed
description Blind people can benefit greatly from a system capable of localising and reading comprehension text embedded in natural scenes and providing useful information that boosts their self-esteem and autonomy in everyday situations. Regardless of the fact that existing optical character recognition programmes seem to be quick and effective, the majority of them are not able to correctly recognise text embedded in usual panorama images. The methodology described in this paper is to localise textual image regions and pre-process them using the naïve Bayesian algorithm. A weighted reading technique is used to generate the correct text data from the complicated image regions. Usually, images hold some disturbance as a result of the fact that filtration is proposed during the early pre-processing step. To restore the image's quality, the input image is processed employing gradient and contrast image methods. Following that, the contrast of the source images would be enhanced using an adaptive image map. The stroke width transform, Gabor’s transform, and weighted naïve Bayesian classifier methodologies have been used in complicated degraded images to segment, feature extraction, and detect textual and non-textual elements. Finally, to identify categorised textual data, the confluence of deep neural networks and particle swarm optimisation is being used. The text in the image is transformed into an acoustic output after identification. The dataset IIIT5K is used for the development portion, and the performance of the suggested come up is evaluated using parameters such as accuracy, recall, precision, and F1-score.
format Online
Article
Text
id pubmed-8553597
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Singapore
record_format MEDLINE/PubMed
spelling pubmed-85535972021-10-29 Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification Pandey, Binay Kumar Pandey, Digvijay Wariya, Subodh Aggarwal, Gaurav Rastogi, Rahul Augment Hum Res Original Paper Blind people can benefit greatly from a system capable of localising and reading comprehension text embedded in natural scenes and providing useful information that boosts their self-esteem and autonomy in everyday situations. Regardless of the fact that existing optical character recognition programmes seem to be quick and effective, the majority of them are not able to correctly recognise text embedded in usual panorama images. The methodology described in this paper is to localise textual image regions and pre-process them using the naïve Bayesian algorithm. A weighted reading technique is used to generate the correct text data from the complicated image regions. Usually, images hold some disturbance as a result of the fact that filtration is proposed during the early pre-processing step. To restore the image's quality, the input image is processed employing gradient and contrast image methods. Following that, the contrast of the source images would be enhanced using an adaptive image map. The stroke width transform, Gabor’s transform, and weighted naïve Bayesian classifier methodologies have been used in complicated degraded images to segment, feature extraction, and detect textual and non-textual elements. Finally, to identify categorised textual data, the confluence of deep neural networks and particle swarm optimisation is being used. The text in the image is transformed into an acoustic output after identification. The dataset IIIT5K is used for the development portion, and the performance of the suggested come up is evaluated using parameters such as accuracy, recall, precision, and F1-score. Springer Singapore 2021-10-29 2021 /pmc/articles/PMC8553597/ http://dx.doi.org/10.1007/s41133-021-00051-5 Text en © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Paper
Pandey, Binay Kumar
Pandey, Digvijay
Wariya, Subodh
Aggarwal, Gaurav
Rastogi, Rahul
Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification
title Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification
title_full Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification
title_fullStr Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification
title_full_unstemmed Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification
title_short Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification
title_sort deep learning and particle swarm optimisation-based techniques for visually impaired humans' text recognition and identification
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553597/
http://dx.doi.org/10.1007/s41133-021-00051-5
work_keys_str_mv AT pandeybinaykumar deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification
AT pandeydigvijay deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification
AT wariyasubodh deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification
AT aggarwalgaurav deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification
AT rastogirahul deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification