Cargando…

British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language

In this work, we show that a late fusion approach to multimodality in sign language recognition improves the overall ability of the model in comparison to the singular approaches of image classification (88.14%) and Leap Motion data classification (72.73%). With a large synchronous dataset of 18 BSL...

Descripción completa

Detalles Bibliográficos
Autores principales: Bird, Jordan J., Ekárt, Anikó, Faria, Diego R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571093/
https://www.ncbi.nlm.nih.gov/pubmed/32917024
http://dx.doi.org/10.3390/s20185151
_version_ 1783597097073770496
author Bird, Jordan J.
Ekárt, Anikó
Faria, Diego R.
author_facet Bird, Jordan J.
Ekárt, Anikó
Faria, Diego R.
author_sort Bird, Jordan J.
collection PubMed
description In this work, we show that a late fusion approach to multimodality in sign language recognition improves the overall ability of the model in comparison to the singular approaches of image classification (88.14%) and Leap Motion data classification (72.73%). With a large synchronous dataset of 18 BSL gestures collected from multiple subjects, two deep neural networks are benchmarked and compared to derive a best topology for each. The Vision model is implemented by a Convolutional Neural Network and optimised Artificial Neural Network, and the Leap Motion model is implemented by an evolutionary search of Artificial Neural Network topology. Next, the two best networks are fused for synchronised processing, which results in a better overall result (94.44%) as complementary features are learnt in addition to the original task. The hypothesis is further supported by application of the three models to a set of completely unseen data where a multimodality approach achieves the best results relative to the single sensor method. When transfer learning with the weights trained via British Sign Language, all three models outperform standard random weight distribution when classifying American Sign Language (ASL), and the best model overall for ASL classification was the transfer learning multimodality approach, which scored 82.55% accuracy.
format Online
Article
Text
id pubmed-7571093
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75710932020-10-28 British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language Bird, Jordan J. Ekárt, Anikó Faria, Diego R. Sensors (Basel) Article In this work, we show that a late fusion approach to multimodality in sign language recognition improves the overall ability of the model in comparison to the singular approaches of image classification (88.14%) and Leap Motion data classification (72.73%). With a large synchronous dataset of 18 BSL gestures collected from multiple subjects, two deep neural networks are benchmarked and compared to derive a best topology for each. The Vision model is implemented by a Convolutional Neural Network and optimised Artificial Neural Network, and the Leap Motion model is implemented by an evolutionary search of Artificial Neural Network topology. Next, the two best networks are fused for synchronised processing, which results in a better overall result (94.44%) as complementary features are learnt in addition to the original task. The hypothesis is further supported by application of the three models to a set of completely unseen data where a multimodality approach achieves the best results relative to the single sensor method. When transfer learning with the weights trained via British Sign Language, all three models outperform standard random weight distribution when classifying American Sign Language (ASL), and the best model overall for ASL classification was the transfer learning multimodality approach, which scored 82.55% accuracy. MDPI 2020-09-09 /pmc/articles/PMC7571093/ /pubmed/32917024 http://dx.doi.org/10.3390/s20185151 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bird, Jordan J.
Ekárt, Anikó
Faria, Diego R.
British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language
title British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language
title_full British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language
title_fullStr British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language
title_full_unstemmed British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language
title_short British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language
title_sort british sign language recognition via late fusion of computer vision and leap motion with transfer learning to american sign language
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571093/
https://www.ncbi.nlm.nih.gov/pubmed/32917024
http://dx.doi.org/10.3390/s20185151
work_keys_str_mv AT birdjordanj britishsignlanguagerecognitionvialatefusionofcomputervisionandleapmotionwithtransferlearningtoamericansignlanguage
AT ekartaniko britishsignlanguagerecognitionvialatefusionofcomputervisionandleapmotionwithtransferlearningtoamericansignlanguage
AT fariadiegor britishsignlanguagerecognitionvialatefusionofcomputervisionandleapmotionwithtransferlearningtoamericansignlanguage