Cargando…

Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition

Sign language recognition, an essential interface between the hearing and deaf-mute communities, faces challenges with high false positive rates and computational costs, even with the use of advanced deep learning techniques. Our proposed solution is a stacked encoded model, combining artificial int...

Descripción completa

Detalles Bibliográficos
Autores principales: Islam, Muhammad, Aloraini, Mohammed, Aladhadh, Suliman, Habib, Shabana, Khan, Asma, Alabdulatif, Abduatif, Alanazi, Turki M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674804/
https://www.ncbi.nlm.nih.gov/pubmed/38005455
http://dx.doi.org/10.3390/s23229068
_version_ 1785140914728992768
author Islam, Muhammad
Aloraini, Mohammed
Aladhadh, Suliman
Habib, Shabana
Khan, Asma
Alabdulatif, Abduatif
Alanazi, Turki M.
author_facet Islam, Muhammad
Aloraini, Mohammed
Aladhadh, Suliman
Habib, Shabana
Khan, Asma
Alabdulatif, Abduatif
Alanazi, Turki M.
author_sort Islam, Muhammad
collection PubMed
description Sign language recognition, an essential interface between the hearing and deaf-mute communities, faces challenges with high false positive rates and computational costs, even with the use of advanced deep learning techniques. Our proposed solution is a stacked encoded model, combining artificial intelligence (AI) with the Internet of Things (IoT), which refines feature extraction and classification to overcome these challenges. We leverage a lightweight backbone model for preliminary feature extraction and use stacked autoencoders to further refine these features. Our approach harnesses the scalability of big data, showing notable improvement in accuracy, precision, recall, F1-score, and complexity analysis. Our model’s effectiveness is demonstrated through testing on the ArSL2018 benchmark dataset, showcasing superior performance compared to state-of-the-art approaches. Additional validation through an ablation study with pre-trained convolutional neural network (CNN) models affirms our model’s efficacy across all evaluation metrics. Our work paves the way for the sustainable development of high-performing, IoT-based sign-language-recognition applications.
format Online
Article
Text
id pubmed-10674804
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106748042023-11-09 Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition Islam, Muhammad Aloraini, Mohammed Aladhadh, Suliman Habib, Shabana Khan, Asma Alabdulatif, Abduatif Alanazi, Turki M. Sensors (Basel) Article Sign language recognition, an essential interface between the hearing and deaf-mute communities, faces challenges with high false positive rates and computational costs, even with the use of advanced deep learning techniques. Our proposed solution is a stacked encoded model, combining artificial intelligence (AI) with the Internet of Things (IoT), which refines feature extraction and classification to overcome these challenges. We leverage a lightweight backbone model for preliminary feature extraction and use stacked autoencoders to further refine these features. Our approach harnesses the scalability of big data, showing notable improvement in accuracy, precision, recall, F1-score, and complexity analysis. Our model’s effectiveness is demonstrated through testing on the ArSL2018 benchmark dataset, showcasing superior performance compared to state-of-the-art approaches. Additional validation through an ablation study with pre-trained convolutional neural network (CNN) models affirms our model’s efficacy across all evaluation metrics. Our work paves the way for the sustainable development of high-performing, IoT-based sign-language-recognition applications. MDPI 2023-11-09 /pmc/articles/PMC10674804/ /pubmed/38005455 http://dx.doi.org/10.3390/s23229068 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Islam, Muhammad
Aloraini, Mohammed
Aladhadh, Suliman
Habib, Shabana
Khan, Asma
Alabdulatif, Abduatif
Alanazi, Turki M.
Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition
title Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition
title_full Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition
title_fullStr Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition
title_full_unstemmed Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition
title_short Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition
title_sort toward a vision-based intelligent system: a stacked encoded deep learning framework for sign language recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674804/
https://www.ncbi.nlm.nih.gov/pubmed/38005455
http://dx.doi.org/10.3390/s23229068
work_keys_str_mv AT islammuhammad towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition
AT alorainimohammed towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition
AT aladhadhsuliman towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition
AT habibshabana towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition
AT khanasma towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition
AT alabdulatifabduatif towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition
AT alanaziturkim towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition