Cargando…

FluentSigners-50: A signer independent benchmark dataset for sign language processing

This paper presents a new large-scale signer independent dataset for Kazakh-Russian Sign Language (KRSL) for the purposes of Sign Language Processing. We envision it to serve as a new benchmark dataset for performance evaluations of Continuous Sign Language Recognition (CSLR) and Translation (CSLT)...

Descripción completa

Detalles Bibliográficos
Autores principales: Mukushev, Medet, Ubingazhibov, Aidyn, Kydyrbekova, Aigerim, Imashev, Alfarabi, Kimmelman, Vadim, Sandygulova, Anara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9467305/
https://www.ncbi.nlm.nih.gov/pubmed/36094924
http://dx.doi.org/10.1371/journal.pone.0273649
_version_ 1784788163321921536
author Mukushev, Medet
Ubingazhibov, Aidyn
Kydyrbekova, Aigerim
Imashev, Alfarabi
Kimmelman, Vadim
Sandygulova, Anara
author_facet Mukushev, Medet
Ubingazhibov, Aidyn
Kydyrbekova, Aigerim
Imashev, Alfarabi
Kimmelman, Vadim
Sandygulova, Anara
author_sort Mukushev, Medet
collection PubMed
description This paper presents a new large-scale signer independent dataset for Kazakh-Russian Sign Language (KRSL) for the purposes of Sign Language Processing. We envision it to serve as a new benchmark dataset for performance evaluations of Continuous Sign Language Recognition (CSLR) and Translation (CSLT) tasks. The proposed FluentSigners-50 dataset consists of 173 sentences performed by 50 KRSL signers resulting in 43,250 video samples. Dataset contributors recorded videos in real-life settings on a wide variety of backgrounds using various devices such as smartphones and web cameras. Therefore, distance to the camera, camera angles and aspect ratio, video quality, and frame rates varied for each dataset contributor. Additionally, the proposed dataset contains a high degree of linguistic and inter-signer variability and thus is a better training set for recognizing a real-life sign language. FluentSigners-50 baseline is established using two state-of-the-art methods, Stochastic CSLR and TSPNet. To this end, we carefully prepared three benchmark train-test splits for models’ evaluations in terms of: signer independence, age independence, and unseen sentences. FluentSigners-50 is publicly available at https://krslproject.github.io/FluentSigners-50/
format Online
Article
Text
id pubmed-9467305
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94673052022-09-13 FluentSigners-50: A signer independent benchmark dataset for sign language processing Mukushev, Medet Ubingazhibov, Aidyn Kydyrbekova, Aigerim Imashev, Alfarabi Kimmelman, Vadim Sandygulova, Anara PLoS One Research Article This paper presents a new large-scale signer independent dataset for Kazakh-Russian Sign Language (KRSL) for the purposes of Sign Language Processing. We envision it to serve as a new benchmark dataset for performance evaluations of Continuous Sign Language Recognition (CSLR) and Translation (CSLT) tasks. The proposed FluentSigners-50 dataset consists of 173 sentences performed by 50 KRSL signers resulting in 43,250 video samples. Dataset contributors recorded videos in real-life settings on a wide variety of backgrounds using various devices such as smartphones and web cameras. Therefore, distance to the camera, camera angles and aspect ratio, video quality, and frame rates varied for each dataset contributor. Additionally, the proposed dataset contains a high degree of linguistic and inter-signer variability and thus is a better training set for recognizing a real-life sign language. FluentSigners-50 baseline is established using two state-of-the-art methods, Stochastic CSLR and TSPNet. To this end, we carefully prepared three benchmark train-test splits for models’ evaluations in terms of: signer independence, age independence, and unseen sentences. FluentSigners-50 is publicly available at https://krslproject.github.io/FluentSigners-50/ Public Library of Science 2022-09-12 /pmc/articles/PMC9467305/ /pubmed/36094924 http://dx.doi.org/10.1371/journal.pone.0273649 Text en © 2022 Mukushev et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mukushev, Medet
Ubingazhibov, Aidyn
Kydyrbekova, Aigerim
Imashev, Alfarabi
Kimmelman, Vadim
Sandygulova, Anara
FluentSigners-50: A signer independent benchmark dataset for sign language processing
title FluentSigners-50: A signer independent benchmark dataset for sign language processing
title_full FluentSigners-50: A signer independent benchmark dataset for sign language processing
title_fullStr FluentSigners-50: A signer independent benchmark dataset for sign language processing
title_full_unstemmed FluentSigners-50: A signer independent benchmark dataset for sign language processing
title_short FluentSigners-50: A signer independent benchmark dataset for sign language processing
title_sort fluentsigners-50: a signer independent benchmark dataset for sign language processing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9467305/
https://www.ncbi.nlm.nih.gov/pubmed/36094924
http://dx.doi.org/10.1371/journal.pone.0273649
work_keys_str_mv AT mukushevmedet fluentsigners50asignerindependentbenchmarkdatasetforsignlanguageprocessing
AT ubingazhibovaidyn fluentsigners50asignerindependentbenchmarkdatasetforsignlanguageprocessing
AT kydyrbekovaaigerim fluentsigners50asignerindependentbenchmarkdatasetforsignlanguageprocessing
AT imashevalfarabi fluentsigners50asignerindependentbenchmarkdatasetforsignlanguageprocessing
AT kimmelmanvadim fluentsigners50asignerindependentbenchmarkdatasetforsignlanguageprocessing
AT sandygulovaanara fluentsigners50asignerindependentbenchmarkdatasetforsignlanguageprocessing