Cargando…

COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features

We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilitie...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pahar, Madhurananda, Klopper, Marisa, Warren, Robin, Niesler, Thomas
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier Ltd. 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8679499/ https://www.ncbi.nlm.nih.gov/pubmed/34954610 http://dx.doi.org/10.1016/j.compbiomed.2021.105153

_version_	1784616536342790144
author	Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas
author_facet	Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas
author_sort	Pahar, Madhurananda
collection	PubMed
description	We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilities and can be deployed on inexpensive consumer hardware such as a smartphone. We use datasets that contain cough, sneeze, speech and other noises, but do not contain COVID-19 labels, to pre-train three deep neural networks: a CNN, an LSTM and a Resnet50. These pre-trained networks are subsequently either fine-tuned using smaller datasets of coughing with COVID-19 labels in the process of transfer learning, or are used as bottleneck feature extractors. Results show that a Resnet50 classifier trained by this transfer learning process delivers optimal or near-optimal performance across all datasets achieving areas under the receiver operating characteristic (ROC AUC) of 0.98, 0.94 and 0.92 respectively for all three sound classes: coughs, breaths and speech. This indicates that coughs carry the strongest COVID-19 signature, followed by breath and speech. Our results also show that applying transfer learning and extracting bottleneck features using the larger datasets without COVID-19 labels led not only to improved performance, but also to a marked reduction in the standard deviation of the classifier AUCs measured over the outer folds during nested cross-validation, indicating better generalisation. We conclude that deep transfer learning and bottleneck feature extraction can improve COVID-19 cough, breath and speech audio classification, yielding automatic COVID-19 detection with a better and more consistent overall performance.
format	Online Article Text
id	pubmed-8679499
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier Ltd.
record_format	MEDLINE/PubMed
spelling	pubmed-86794992021-12-17 COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas Comput Biol Med Article We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilities and can be deployed on inexpensive consumer hardware such as a smartphone. We use datasets that contain cough, sneeze, speech and other noises, but do not contain COVID-19 labels, to pre-train three deep neural networks: a CNN, an LSTM and a Resnet50. These pre-trained networks are subsequently either fine-tuned using smaller datasets of coughing with COVID-19 labels in the process of transfer learning, or are used as bottleneck feature extractors. Results show that a Resnet50 classifier trained by this transfer learning process delivers optimal or near-optimal performance across all datasets achieving areas under the receiver operating characteristic (ROC AUC) of 0.98, 0.94 and 0.92 respectively for all three sound classes: coughs, breaths and speech. This indicates that coughs carry the strongest COVID-19 signature, followed by breath and speech. Our results also show that applying transfer learning and extracting bottleneck features using the larger datasets without COVID-19 labels led not only to improved performance, but also to a marked reduction in the standard deviation of the classifier AUCs measured over the outer folds during nested cross-validation, indicating better generalisation. We conclude that deep transfer learning and bottleneck feature extraction can improve COVID-19 cough, breath and speech audio classification, yielding automatic COVID-19 detection with a better and more consistent overall performance. Elsevier Ltd. 2022-02 2021-12-17 /pmc/articles/PMC8679499/ /pubmed/34954610 http://dx.doi.org/10.1016/j.compbiomed.2021.105153 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle	Article Pahar, Madhurananda Klopper, Marisa Warren, Robin Niesler, Thomas COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title	COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_full	COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_fullStr	COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_full_unstemmed	COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_short	COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
title_sort	covid-19 detection in cough, breath and speech using deep transfer learning and bottleneck features
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8679499/ https://www.ncbi.nlm.nih.gov/pubmed/34954610 http://dx.doi.org/10.1016/j.compbiomed.2021.105153
work_keys_str_mv	AT paharmadhurananda covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures AT kloppermarisa covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures AT warrenrobin covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures AT nieslerthomas covid19detectionincoughbreathandspeechusingdeeptransferlearningandbottleneckfeatures

COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features

Ejemplares similares