Cargando…
Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition
Sign language recognition, an essential interface between the hearing and deaf-mute communities, faces challenges with high false positive rates and computational costs, even with the use of advanced deep learning techniques. Our proposed solution is a stacked encoded model, combining artificial int...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674804/ https://www.ncbi.nlm.nih.gov/pubmed/38005455 http://dx.doi.org/10.3390/s23229068 |
_version_ | 1785140914728992768 |
---|---|
author | Islam, Muhammad Aloraini, Mohammed Aladhadh, Suliman Habib, Shabana Khan, Asma Alabdulatif, Abduatif Alanazi, Turki M. |
author_facet | Islam, Muhammad Aloraini, Mohammed Aladhadh, Suliman Habib, Shabana Khan, Asma Alabdulatif, Abduatif Alanazi, Turki M. |
author_sort | Islam, Muhammad |
collection | PubMed |
description | Sign language recognition, an essential interface between the hearing and deaf-mute communities, faces challenges with high false positive rates and computational costs, even with the use of advanced deep learning techniques. Our proposed solution is a stacked encoded model, combining artificial intelligence (AI) with the Internet of Things (IoT), which refines feature extraction and classification to overcome these challenges. We leverage a lightweight backbone model for preliminary feature extraction and use stacked autoencoders to further refine these features. Our approach harnesses the scalability of big data, showing notable improvement in accuracy, precision, recall, F1-score, and complexity analysis. Our model’s effectiveness is demonstrated through testing on the ArSL2018 benchmark dataset, showcasing superior performance compared to state-of-the-art approaches. Additional validation through an ablation study with pre-trained convolutional neural network (CNN) models affirms our model’s efficacy across all evaluation metrics. Our work paves the way for the sustainable development of high-performing, IoT-based sign-language-recognition applications. |
format | Online Article Text |
id | pubmed-10674804 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106748042023-11-09 Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition Islam, Muhammad Aloraini, Mohammed Aladhadh, Suliman Habib, Shabana Khan, Asma Alabdulatif, Abduatif Alanazi, Turki M. Sensors (Basel) Article Sign language recognition, an essential interface between the hearing and deaf-mute communities, faces challenges with high false positive rates and computational costs, even with the use of advanced deep learning techniques. Our proposed solution is a stacked encoded model, combining artificial intelligence (AI) with the Internet of Things (IoT), which refines feature extraction and classification to overcome these challenges. We leverage a lightweight backbone model for preliminary feature extraction and use stacked autoencoders to further refine these features. Our approach harnesses the scalability of big data, showing notable improvement in accuracy, precision, recall, F1-score, and complexity analysis. Our model’s effectiveness is demonstrated through testing on the ArSL2018 benchmark dataset, showcasing superior performance compared to state-of-the-art approaches. Additional validation through an ablation study with pre-trained convolutional neural network (CNN) models affirms our model’s efficacy across all evaluation metrics. Our work paves the way for the sustainable development of high-performing, IoT-based sign-language-recognition applications. MDPI 2023-11-09 /pmc/articles/PMC10674804/ /pubmed/38005455 http://dx.doi.org/10.3390/s23229068 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Islam, Muhammad Aloraini, Mohammed Aladhadh, Suliman Habib, Shabana Khan, Asma Alabdulatif, Abduatif Alanazi, Turki M. Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition |
title | Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition |
title_full | Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition |
title_fullStr | Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition |
title_full_unstemmed | Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition |
title_short | Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition |
title_sort | toward a vision-based intelligent system: a stacked encoded deep learning framework for sign language recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674804/ https://www.ncbi.nlm.nih.gov/pubmed/38005455 http://dx.doi.org/10.3390/s23229068 |
work_keys_str_mv | AT islammuhammad towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition AT alorainimohammed towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition AT aladhadhsuliman towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition AT habibshabana towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition AT khanasma towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition AT alabdulatifabduatif towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition AT alanaziturkim towardavisionbasedintelligentsystemastackedencodeddeeplearningframeworkforsignlanguagerecognition |