Cargando…
Pushing the limits of remote RF sensing by reading lips under the face mask
The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9452506/ https://www.ncbi.nlm.nih.gov/pubmed/36071056 http://dx.doi.org/10.1038/s41467-022-32231-1 |
_version_ | 1784784924392292352 |
---|---|
author | Hameed, Hira Usman, Muhammad Tahir, Ahsen Hussain, Amir Abbas, Hasan Cui, Tie Jun Imran, Muhammad Ali Abbasi, Qammer H. |
author_facet | Hameed, Hira Usman, Muhammad Tahir, Ahsen Hussain, Amir Abbas, Hasan Cui, Tie Jun Imran, Muhammad Ali Abbasi, Qammer H. |
author_sort | Hameed, Hira |
collection | PubMed |
description | The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known limitations of occlusion and ambient lighting with serious privacy concerns. Furthermore, vision-based technologies are not useful for multi-modal hearing aids in the coronavirus (COVID-19) environment, where face masks have become a norm. This paper aims to solve the fundamental limitations of camera-based systems by proposing a radio frequency (RF) based Lip-reading framework, having an ability to read lips under face masks. The framework employs Wi-Fi and radar technologies as enablers of RF sensing based Lip-reading. A dataset comprising of vowels A, E, I, O, U and empty (static/closed lips) is collected using both technologies, with a face mask. The collected data is used to train machine learning (ML) and deep learning (DL) models. A high classification accuracy of 95% is achieved on the Wi-Fi data utilising neural network (NN) models. Moreover, similar accuracy is achieved by VGG16 deep learning model on the collected radar-based dataset. |
format | Online Article Text |
id | pubmed-9452506 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-94525062022-09-09 Pushing the limits of remote RF sensing by reading lips under the face mask Hameed, Hira Usman, Muhammad Tahir, Ahsen Hussain, Amir Abbas, Hasan Cui, Tie Jun Imran, Muhammad Ali Abbasi, Qammer H. Nat Commun Article The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known limitations of occlusion and ambient lighting with serious privacy concerns. Furthermore, vision-based technologies are not useful for multi-modal hearing aids in the coronavirus (COVID-19) environment, where face masks have become a norm. This paper aims to solve the fundamental limitations of camera-based systems by proposing a radio frequency (RF) based Lip-reading framework, having an ability to read lips under face masks. The framework employs Wi-Fi and radar technologies as enablers of RF sensing based Lip-reading. A dataset comprising of vowels A, E, I, O, U and empty (static/closed lips) is collected using both technologies, with a face mask. The collected data is used to train machine learning (ML) and deep learning (DL) models. A high classification accuracy of 95% is achieved on the Wi-Fi data utilising neural network (NN) models. Moreover, similar accuracy is achieved by VGG16 deep learning model on the collected radar-based dataset. Nature Publishing Group UK 2022-09-07 /pmc/articles/PMC9452506/ /pubmed/36071056 http://dx.doi.org/10.1038/s41467-022-32231-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Hameed, Hira Usman, Muhammad Tahir, Ahsen Hussain, Amir Abbas, Hasan Cui, Tie Jun Imran, Muhammad Ali Abbasi, Qammer H. Pushing the limits of remote RF sensing by reading lips under the face mask |
title | Pushing the limits of remote RF sensing by reading lips under the face mask |
title_full | Pushing the limits of remote RF sensing by reading lips under the face mask |
title_fullStr | Pushing the limits of remote RF sensing by reading lips under the face mask |
title_full_unstemmed | Pushing the limits of remote RF sensing by reading lips under the face mask |
title_short | Pushing the limits of remote RF sensing by reading lips under the face mask |
title_sort | pushing the limits of remote rf sensing by reading lips under the face mask |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9452506/ https://www.ncbi.nlm.nih.gov/pubmed/36071056 http://dx.doi.org/10.1038/s41467-022-32231-1 |
work_keys_str_mv | AT hameedhira pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask AT usmanmuhammad pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask AT tahirahsen pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask AT hussainamir pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask AT abbashasan pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask AT cuitiejun pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask AT imranmuhammadali pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask AT abbasiqammerh pushingthelimitsofremoterfsensingbyreadinglipsunderthefacemask |