Cargando…
Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification
Blind people can benefit greatly from a system capable of localising and reading comprehension text embedded in natural scenes and providing useful information that boosts their self-esteem and autonomy in everyday situations. Regardless of the fact that existing optical character recognition progra...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Singapore
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553597/ http://dx.doi.org/10.1007/s41133-021-00051-5 |
_version_ | 1784591614524522496 |
---|---|
author | Pandey, Binay Kumar Pandey, Digvijay Wariya, Subodh Aggarwal, Gaurav Rastogi, Rahul |
author_facet | Pandey, Binay Kumar Pandey, Digvijay Wariya, Subodh Aggarwal, Gaurav Rastogi, Rahul |
author_sort | Pandey, Binay Kumar |
collection | PubMed |
description | Blind people can benefit greatly from a system capable of localising and reading comprehension text embedded in natural scenes and providing useful information that boosts their self-esteem and autonomy in everyday situations. Regardless of the fact that existing optical character recognition programmes seem to be quick and effective, the majority of them are not able to correctly recognise text embedded in usual panorama images. The methodology described in this paper is to localise textual image regions and pre-process them using the naïve Bayesian algorithm. A weighted reading technique is used to generate the correct text data from the complicated image regions. Usually, images hold some disturbance as a result of the fact that filtration is proposed during the early pre-processing step. To restore the image's quality, the input image is processed employing gradient and contrast image methods. Following that, the contrast of the source images would be enhanced using an adaptive image map. The stroke width transform, Gabor’s transform, and weighted naïve Bayesian classifier methodologies have been used in complicated degraded images to segment, feature extraction, and detect textual and non-textual elements. Finally, to identify categorised textual data, the confluence of deep neural networks and particle swarm optimisation is being used. The text in the image is transformed into an acoustic output after identification. The dataset IIIT5K is used for the development portion, and the performance of the suggested come up is evaluated using parameters such as accuracy, recall, precision, and F1-score. |
format | Online Article Text |
id | pubmed-8553597 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Singapore |
record_format | MEDLINE/PubMed |
spelling | pubmed-85535972021-10-29 Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification Pandey, Binay Kumar Pandey, Digvijay Wariya, Subodh Aggarwal, Gaurav Rastogi, Rahul Augment Hum Res Original Paper Blind people can benefit greatly from a system capable of localising and reading comprehension text embedded in natural scenes and providing useful information that boosts their self-esteem and autonomy in everyday situations. Regardless of the fact that existing optical character recognition programmes seem to be quick and effective, the majority of them are not able to correctly recognise text embedded in usual panorama images. The methodology described in this paper is to localise textual image regions and pre-process them using the naïve Bayesian algorithm. A weighted reading technique is used to generate the correct text data from the complicated image regions. Usually, images hold some disturbance as a result of the fact that filtration is proposed during the early pre-processing step. To restore the image's quality, the input image is processed employing gradient and contrast image methods. Following that, the contrast of the source images would be enhanced using an adaptive image map. The stroke width transform, Gabor’s transform, and weighted naïve Bayesian classifier methodologies have been used in complicated degraded images to segment, feature extraction, and detect textual and non-textual elements. Finally, to identify categorised textual data, the confluence of deep neural networks and particle swarm optimisation is being used. The text in the image is transformed into an acoustic output after identification. The dataset IIIT5K is used for the development portion, and the performance of the suggested come up is evaluated using parameters such as accuracy, recall, precision, and F1-score. Springer Singapore 2021-10-29 2021 /pmc/articles/PMC8553597/ http://dx.doi.org/10.1007/s41133-021-00051-5 Text en © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Paper Pandey, Binay Kumar Pandey, Digvijay Wariya, Subodh Aggarwal, Gaurav Rastogi, Rahul Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification |
title | Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification |
title_full | Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification |
title_fullStr | Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification |
title_full_unstemmed | Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification |
title_short | Deep Learning and Particle Swarm Optimisation-Based Techniques for Visually Impaired Humans' Text Recognition and Identification |
title_sort | deep learning and particle swarm optimisation-based techniques for visually impaired humans' text recognition and identification |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553597/ http://dx.doi.org/10.1007/s41133-021-00051-5 |
work_keys_str_mv | AT pandeybinaykumar deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification AT pandeydigvijay deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification AT wariyasubodh deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification AT aggarwalgaurav deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification AT rastogirahul deeplearningandparticleswarmoptimisationbasedtechniquesforvisuallyimpairedhumanstextrecognitionandidentification |