Cargando…
COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilitie...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8679499/ https://www.ncbi.nlm.nih.gov/pubmed/34954610 http://dx.doi.org/10.1016/j.compbiomed.2021.105153 |
_version_ | 1784616536342790144 |
---|---|
author | Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas |
author_facet | Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas |
author_sort | Pahar, Madhurananda |
collection | PubMed |
description | We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilities and can be deployed on inexpensive consumer hardware such as a smartphone. We use datasets that contain cough, sneeze, speech and other noises, but do not contain COVID-19 labels, to pre-train three deep neural networks: a CNN, an LSTM and a Resnet50. These pre-trained networks are subsequently either fine-tuned using smaller datasets of coughing with COVID-19 labels in the process of transfer learning, or are used as bottleneck feature extractors. Results show that a Resnet50 classifier trained by this transfer learning process delivers optimal or near-optimal performance across all datasets achieving areas under the receiver operating characteristic (ROC AUC) of 0.98, 0.94 and 0.92 respectively for all three sound classes: coughs, breaths and speech. This indicates that coughs carry the strongest COVID-19 signature, followed by breath and speech. Our results also show that applying transfer learning and extracting bottleneck features using the larger datasets without COVID-19 labels led not only to improved performance, but also to a marked reduction in the standard deviation of the classifier AUCs measured over the outer folds during nested cross-validation, indicating better generalisation. We conclude that deep transfer learning and bottleneck feature extraction can improve COVID-19 cough, breath and speech audio classification, yielding automatic COVID-19 detection with a better and more consistent overall performance. |
format | Online Article Text |
id | pubmed-8679499 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86794992021-12-17 COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas Comput Biol Med Article We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilities and can be deployed on inexpensive consumer hardware such as a smartphone. We use datasets that contain cough, sneeze, speech and other noises, but do not contain COVID-19 labels, to pre-train three deep neural networks: a CNN, an LSTM and a Resnet50. These pre-trained networks are subsequently either fine-tuned using smaller datasets of coughing with COVID-19 labels in the process of transfer learning, or are used as bottleneck feature extractors. Results show that a Resnet50 classifier trained by this transfer learning process delivers optimal or near-optimal performance across all datasets achieving areas under the receiver operating characteristic (ROC AUC) of 0.98, 0.94 and 0.92 respectively for all three sound classes: coughs, breaths and speech. This indicates that coughs carry the strongest COVID-19 signature, followed by breath and speech. Our results also show that applying transfer learning and extracting bottleneck features using the larger datasets without COVID-19 labels led not only to improved performance, but also to a marked reduction in the standard deviation of the classifier AUCs measured over the outer folds during nested cross-validation, indicating better generalisation. We conclude that deep transfer learning and bottleneck feature extraction can improve COVID-19 cough, breath and speech audio classification, yielding automatic COVID-19 detection with a better and more consistent overall performance. Elsevier Ltd. 2022-02 2021-12-17 /pmc/articles/PMC8679499/ /pubmed/34954610 http://dx.doi.org/10.1016/j.compbiomed.2021.105153 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features |
title | COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features |
title_full | COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features |
title_fullStr | COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features |
title_full_unstemmed | COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features |
title_short | COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features |
title_sort | covid-19 detection in cough, breath and speech using deep transfer learning and bottleneck features |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8679499/ https://www.ncbi.nlm.nih.gov/pubmed/34954610 http://dx.doi.org/10.1016/j.compbiomed.2021.105153 |
work_keys_str_mv | AT paharmadhurananda covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures AT kloppermarisa covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures AT warrenrobin covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures AT nieslerthomas covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures |