Cargando…

Deep learning based cough detection camera using enhanced features

Coughing is a typical symptom of COVID-19. To detect and localize coughing sounds remotely, a convolutional neural network (CNN) based deep learning model was developed in this work and integrated with a sound camera for the visualization of the cough sounds. The cough detection model is a binary cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Gyeong-Tae, Nam, Hyeonuk, Kim, Seong-Hu, Choi, Sang-Min, Kim, Youngkey, Park, Yong-Hwa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Authors. Published by Elsevier Ltd. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9181707/
https://www.ncbi.nlm.nih.gov/pubmed/35712056
http://dx.doi.org/10.1016/j.eswa.2022.117811
_version_ 1784723848220901376
author Lee, Gyeong-Tae
Nam, Hyeonuk
Kim, Seong-Hu
Choi, Sang-Min
Kim, Youngkey
Park, Yong-Hwa
author_facet Lee, Gyeong-Tae
Nam, Hyeonuk
Kim, Seong-Hu
Choi, Sang-Min
Kim, Youngkey
Park, Yong-Hwa
author_sort Lee, Gyeong-Tae
collection PubMed
description Coughing is a typical symptom of COVID-19. To detect and localize coughing sounds remotely, a convolutional neural network (CNN) based deep learning model was developed in this work and integrated with a sound camera for the visualization of the cough sounds. The cough detection model is a binary classifier of which the input is a two second acoustic feature and the output is one of two inferences (Cough or Others). Data augmentation was performed on the collected audio files to alleviate class imbalance and reflect various background noises in practical environments. For effective featuring of the cough sound, conventional features such as spectrograms, mel-scaled spectrograms, and mel-frequency cepstral coefficients (MFCC) were reinforced by utilizing their velocity (V) and acceleration (A) maps in this work. VGGNet, GoogLeNet, and ResNet were simplified to binary classifiers, and were named V-net, G-net, and R-net, respectively. To find the best combination of features and networks, training was performed for a total of 39 cases and the performance was confirmed using the test F1 score. Finally, a test F1 score of 91.9% (test accuracy of 97.2%) was achieved from G-net with the MFCC-V-A feature (named Spectroflow), an acoustic feature effective for use in cough detection. The trained cough detection model was integrated with a sound camera (i.e., one that visualizes sound sources using a beamforming microphone array). In a pilot test, the cough detection camera detected coughing sounds with an F1 score of 90.0% (accuracy of 96.0%), and the cough location in the camera image was tracked in real time.
format Online
Article
Text
id pubmed-9181707
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Authors. Published by Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-91817072022-06-10 Deep learning based cough detection camera using enhanced features Lee, Gyeong-Tae Nam, Hyeonuk Kim, Seong-Hu Choi, Sang-Min Kim, Youngkey Park, Yong-Hwa Expert Syst Appl Article Coughing is a typical symptom of COVID-19. To detect and localize coughing sounds remotely, a convolutional neural network (CNN) based deep learning model was developed in this work and integrated with a sound camera for the visualization of the cough sounds. The cough detection model is a binary classifier of which the input is a two second acoustic feature and the output is one of two inferences (Cough or Others). Data augmentation was performed on the collected audio files to alleviate class imbalance and reflect various background noises in practical environments. For effective featuring of the cough sound, conventional features such as spectrograms, mel-scaled spectrograms, and mel-frequency cepstral coefficients (MFCC) were reinforced by utilizing their velocity (V) and acceleration (A) maps in this work. VGGNet, GoogLeNet, and ResNet were simplified to binary classifiers, and were named V-net, G-net, and R-net, respectively. To find the best combination of features and networks, training was performed for a total of 39 cases and the performance was confirmed using the test F1 score. Finally, a test F1 score of 91.9% (test accuracy of 97.2%) was achieved from G-net with the MFCC-V-A feature (named Spectroflow), an acoustic feature effective for use in cough detection. The trained cough detection model was integrated with a sound camera (i.e., one that visualizes sound sources using a beamforming microphone array). In a pilot test, the cough detection camera detected coughing sounds with an F1 score of 90.0% (accuracy of 96.0%), and the cough location in the camera image was tracked in real time. The Authors. Published by Elsevier Ltd. 2022-11-15 2022-06-09 /pmc/articles/PMC9181707/ /pubmed/35712056 http://dx.doi.org/10.1016/j.eswa.2022.117811 Text en © 2022 The Authors. Published by Elsevier Ltd. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Lee, Gyeong-Tae
Nam, Hyeonuk
Kim, Seong-Hu
Choi, Sang-Min
Kim, Youngkey
Park, Yong-Hwa
Deep learning based cough detection camera using enhanced features
title Deep learning based cough detection camera using enhanced features
title_full Deep learning based cough detection camera using enhanced features
title_fullStr Deep learning based cough detection camera using enhanced features
title_full_unstemmed Deep learning based cough detection camera using enhanced features
title_short Deep learning based cough detection camera using enhanced features
title_sort deep learning based cough detection camera using enhanced features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9181707/
https://www.ncbi.nlm.nih.gov/pubmed/35712056
http://dx.doi.org/10.1016/j.eswa.2022.117811
work_keys_str_mv AT leegyeongtae deeplearningbasedcoughdetectioncamerausingenhancedfeatures
AT namhyeonuk deeplearningbasedcoughdetectioncamerausingenhancedfeatures
AT kimseonghu deeplearningbasedcoughdetectioncamerausingenhancedfeatures
AT choisangmin deeplearningbasedcoughdetectioncamerausingenhancedfeatures
AT kimyoungkey deeplearningbasedcoughdetectioncamerausingenhancedfeatures
AT parkyonghwa deeplearningbasedcoughdetectioncamerausingenhancedfeatures