Cargando…

Pushing the limits of remote RF sensing by reading lips under the face mask

The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known...

Descripción completa

Detalles Bibliográficos
Autores principales: Hameed, Hira, Usman, Muhammad, Tahir, Ahsen, Hussain, Amir, Abbas, Hasan, Cui, Tie Jun, Imran, Muhammad Ali, Abbasi, Qammer H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9452506/
https://www.ncbi.nlm.nih.gov/pubmed/36071056
http://dx.doi.org/10.1038/s41467-022-32231-1
_version_ 1784784924392292352
author Hameed, Hira
Usman, Muhammad
Tahir, Ahsen
Hussain, Amir
Abbas, Hasan
Cui, Tie Jun
Imran, Muhammad Ali
Abbasi, Qammer H.
author_facet Hameed, Hira
Usman, Muhammad
Tahir, Ahsen
Hussain, Amir
Abbas, Hasan
Cui, Tie Jun
Imran, Muhammad Ali
Abbasi, Qammer H.
author_sort Hameed, Hira
collection PubMed
description The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known limitations of occlusion and ambient lighting with serious privacy concerns. Furthermore, vision-based technologies are not useful for multi-modal hearing aids in the coronavirus (COVID-19) environment, where face masks have become a norm. This paper aims to solve the fundamental limitations of camera-based systems by proposing a radio frequency (RF) based Lip-reading framework, having an ability to read lips under face masks. The framework employs Wi-Fi and radar technologies as enablers of RF sensing based Lip-reading. A dataset comprising of vowels A, E, I, O, U and empty (static/closed lips) is collected using both technologies, with a face mask. The collected data is used to train machine learning (ML) and deep learning (DL) models. A high classification accuracy of 95% is achieved on the Wi-Fi data utilising neural network (NN) models. Moreover, similar accuracy is achieved by VGG16 deep learning model on the collected radar-based dataset.
format Online
Article
Text
id pubmed-9452506
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-94525062022-09-09 Pushing the limits of remote RF sensing by reading lips under the face mask Hameed, Hira Usman, Muhammad Tahir, Ahsen Hussain, Amir Abbas, Hasan Cui, Tie Jun Imran, Muhammad Ali Abbasi, Qammer H. Nat Commun Article The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known limitations of occlusion and ambient lighting with serious privacy concerns. Furthermore, vision-based technologies are not useful for multi-modal hearing aids in the coronavirus (COVID-19) environment, where face masks have become a norm. This paper aims to solve the fundamental limitations of camera-based systems by proposing a radio frequency (RF) based Lip-reading framework, having an ability to read lips under face masks. The framework employs Wi-Fi and radar technologies as enablers of RF sensing based Lip-reading. A dataset comprising of vowels A, E, I, O, U and empty (static/closed lips) is collected using both technologies, with a face mask. The collected data is used to train machine learning (ML) and deep learning (DL) models. A high classification accuracy of 95% is achieved on the Wi-Fi data utilising neural network (NN) models. Moreover, similar accuracy is achieved by VGG16 deep learning model on the collected radar-based dataset. Nature Publishing Group UK 2022-09-07 /pmc/articles/PMC9452506/ /pubmed/36071056 http://dx.doi.org/10.1038/s41467-022-32231-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Hameed, Hira
Usman, Muhammad
Tahir, Ahsen
Hussain, Amir
Abbas, Hasan
Cui, Tie Jun
Imran, Muhammad Ali
Abbasi, Qammer H.
Pushing the limits of remote RF sensing by reading lips under the face mask
title Pushing the limits of remote RF sensing by reading lips under the face mask
title_full Pushing the limits of remote RF sensing by reading lips under the face mask
title_fullStr Pushing the limits of remote RF sensing by reading lips under the face mask
title_full_unstemmed Pushing the limits of remote RF sensing by reading lips under the face mask
title_short Pushing the limits of remote RF sensing by reading lips under the face mask
title_sort pushing the limits of remote rf sensing by reading lips under the face mask
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9452506/
https://www.ncbi.nlm.nih.gov/pubmed/36071056
http://dx.doi.org/10.1038/s41467-022-32231-1
work_keys_str_mv AT hameedhira pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask
AT usmanmuhammad pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask
AT tahirahsen pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask
AT hussainamir pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask
AT abbashasan pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask
AT cuitiejun pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask
AT imranmuhammadali pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask
AT abbasiqammerh pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask