Cargando…

COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features

We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilitie...

Descripción completa

Detalles Bibliográficos
Autores principales: Pahar, Madhurananda, Klopper, Marisa, Warren, Robin, Niesler, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8679499/
https://www.ncbi.nlm.nih.gov/pubmed/34954610
http://dx.doi.org/10.1016/j.compbiomed.2021.105153
_version_ 1784616536342790144
author Pahar, Madhurananda
Klopper, Marisa
Warren, Robin
Niesler, Thomas
author_facet Pahar, Madhurananda
Klopper, Marisa
Warren, Robin
Niesler, Thomas
author_sort Pahar, Madhurananda
collection PubMed
description We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilities and can be deployed on inexpensive consumer hardware such as a smartphone. We use datasets that contain cough, sneeze, speech and other noises, but do not contain COVID-19 labels, to pre-train three deep neural networks: a CNN, an LSTM and a Resnet50. These pre-trained networks are subsequently either fine-tuned using smaller datasets of coughing with COVID-19 labels in the process of transfer learning, or are used as bottleneck feature extractors. Results show that a Resnet50 classifier trained by this transfer learning process delivers optimal or near-optimal performance across all datasets achieving areas under the receiver operating characteristic (ROC AUC) of 0.98, 0.94 and 0.92 respectively for all three sound classes: coughs, breaths and speech. This indicates that coughs carry the strongest COVID-19 signature, followed by breath and speech. Our results also show that applying transfer learning and extracting bottleneck features using the larger datasets without COVID-19 labels led not only to improved performance, but also to a marked reduction in the standard deviation of the classifier AUCs measured over the outer folds during nested cross-validation, indicating better generalisation. We conclude that deep transfer learning and bottleneck feature extraction can improve COVID-19 cough, breath and speech audio classification, yielding automatic COVID-19 detection with a better and more consistent overall performance.
format Online
Article
Text
id pubmed-8679499
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-86794992021-12-17 COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas Comput Biol Med Article We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilities and can be deployed on inexpensive consumer hardware such as a smartphone. We use datasets that contain cough, sneeze, speech and other noises, but do not contain COVID-19 labels, to pre-train three deep neural networks: a CNN, an LSTM and a Resnet50. These pre-trained networks are subsequently either fine-tuned using smaller datasets of coughing with COVID-19 labels in the process of transfer learning, or are used as bottleneck feature extractors. Results show that a Resnet50 classifier trained by this transfer learning process delivers optimal or near-optimal performance across all datasets achieving areas under the receiver operating characteristic (ROC AUC) of 0.98, 0.94 and 0.92 respectively for all three sound classes: coughs, breaths and speech. This indicates that coughs carry the strongest COVID-19 signature, followed by breath and speech. Our results also show that applying transfer learning and extracting bottleneck features using the larger datasets without COVID-19 labels led not only to improved performance, but also to a marked reduction in the standard deviation of the classifier AUCs measured over the outer folds during nested cross-validation, indicating better generalisation. We conclude that deep transfer learning and bottleneck feature extraction can improve COVID-19 cough, breath and speech audio classification, yielding automatic COVID-19 detection with a better and more consistent overall performance. Elsevier Ltd. 2022-02 2021-12-17 /pmc/articles/PMC8679499/ /pubmed/34954610 http://dx.doi.org/10.1016/j.compbiomed.2021.105153 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Pahar, Madhurananda
Klopper, Marisa
Warren, Robin
Niesler, Thomas
COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_full COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_fullStr COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_full_unstemmed COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_short COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_sort covid-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8679499/
https://www.ncbi.nlm.nih.gov/pubmed/34954610
http://dx.doi.org/10.1016/j.compbiomed.2021.105153
work_keys_str_mv AT paharmadhurananda covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures
AT kloppermarisa covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures
AT warrenrobin covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures
AT nieslerthomas covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures