Cargando…

Vision-based Pakistani sign language recognition using bag-of-words and support vector machines

In order to perform their daily activities, a person is required to communicating with others. This can be a major obstacle for the deaf population of the world, who communicate using sign languages (SL). Pakistani Sign Language (PSL) is used by more than 250,000 deaf Pakistanis. Developing a SL rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Mirza, Muhammad Shaheer, Munaf, Sheikh Muhammad, Azim, Fahad, Ali, Shahid, Khan, Saad Jawaid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9734649/
https://www.ncbi.nlm.nih.gov/pubmed/36494382
http://dx.doi.org/10.1038/s41598-022-15864-6
_version_ 1784846619470987264
author Mirza, Muhammad Shaheer
Munaf, Sheikh Muhammad
Azim, Fahad
Ali, Shahid
Khan, Saad Jawaid
author_facet Mirza, Muhammad Shaheer
Munaf, Sheikh Muhammad
Azim, Fahad
Ali, Shahid
Khan, Saad Jawaid
author_sort Mirza, Muhammad Shaheer
collection PubMed
description In order to perform their daily activities, a person is required to communicating with others. This can be a major obstacle for the deaf population of the world, who communicate using sign languages (SL). Pakistani Sign Language (PSL) is used by more than 250,000 deaf Pakistanis. Developing a SL recognition system would greatly facilitate these people. This study aimed to collect data of static and dynamic PSL alphabets and to develop a vision-based system for their recognition using Bag-of-Words (BoW) and Support Vector Machine (SVM) techniques. A total of 5120 images for 36 static PSL alphabet signs and 353 videos with 45,224 frames for 3 dynamic PSL alphabet signs were collected from 10 native signers of PSL. The developed system used the collected data as input, resized the data to various scales and converted the RGB images into grayscale. The resized grayscale images were segmented using Thresholding technique and features were extracted using Speeded Up Robust Feature (SURF). The obtained SURF descriptors were clustered using K-means clustering. A BoW was obtained by computing the Euclidean distance between the SURF descriptors and the clustered data. The codebooks were divided into training and testing using fivefold cross validation. The highest overall classification accuracy for static PSL signs was 97.80% at 750 × 750 image dimensions and 500 Bags. For dynamic PSL signs a 96.53% accuracy was obtained at 480 × 270 video resolution and 200 Bags.
format Online
Article
Text
id pubmed-9734649
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97346492022-12-11 Vision-based Pakistani sign language recognition using bag-of-words and support vector machines Mirza, Muhammad Shaheer Munaf, Sheikh Muhammad Azim, Fahad Ali, Shahid Khan, Saad Jawaid Sci Rep Article In order to perform their daily activities, a person is required to communicating with others. This can be a major obstacle for the deaf population of the world, who communicate using sign languages (SL). Pakistani Sign Language (PSL) is used by more than 250,000 deaf Pakistanis. Developing a SL recognition system would greatly facilitate these people. This study aimed to collect data of static and dynamic PSL alphabets and to develop a vision-based system for their recognition using Bag-of-Words (BoW) and Support Vector Machine (SVM) techniques. A total of 5120 images for 36 static PSL alphabet signs and 353 videos with 45,224 frames for 3 dynamic PSL alphabet signs were collected from 10 native signers of PSL. The developed system used the collected data as input, resized the data to various scales and converted the RGB images into grayscale. The resized grayscale images were segmented using Thresholding technique and features were extracted using Speeded Up Robust Feature (SURF). The obtained SURF descriptors were clustered using K-means clustering. A BoW was obtained by computing the Euclidean distance between the SURF descriptors and the clustered data. The codebooks were divided into training and testing using fivefold cross validation. The highest overall classification accuracy for static PSL signs was 97.80% at 750 × 750 image dimensions and 500 Bags. For dynamic PSL signs a 96.53% accuracy was obtained at 480 × 270 video resolution and 200 Bags. Nature Publishing Group UK 2022-12-09 /pmc/articles/PMC9734649/ /pubmed/36494382 http://dx.doi.org/10.1038/s41598-022-15864-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Mirza, Muhammad Shaheer
Munaf, Sheikh Muhammad
Azim, Fahad
Ali, Shahid
Khan, Saad Jawaid
Vision-based Pakistani sign language recognition using bag-of-words and support vector machines
title Vision-based Pakistani sign language recognition using bag-of-words and support vector machines
title_full Vision-based Pakistani sign language recognition using bag-of-words and support vector machines
title_fullStr Vision-based Pakistani sign language recognition using bag-of-words and support vector machines
title_full_unstemmed Vision-based Pakistani sign language recognition using bag-of-words and support vector machines
title_short Vision-based Pakistani sign language recognition using bag-of-words and support vector machines
title_sort vision-based pakistani sign language recognition using bag-of-words and support vector machines
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9734649/
https://www.ncbi.nlm.nih.gov/pubmed/36494382
http://dx.doi.org/10.1038/s41598-022-15864-6
work_keys_str_mv AT mirzamuhammadshaheer visionbasedpakistanisignlanguagerecognitionusingbagofwordsandsupportvectormachines
AT munafsheikhmuhammad visionbasedpakistanisignlanguagerecognitionusingbagofwordsandsupportvectormachines
AT azimfahad visionbasedpakistanisignlanguagerecognitionusingbagofwordsandsupportvectormachines
AT alishahid visionbasedpakistanisignlanguagerecognitionusingbagofwordsandsupportvectormachines
AT khansaadjawaid visionbasedpakistanisignlanguagerecognitionusingbagofwordsandsupportvectormachines