Cargando…
Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression
Automatic cough detection in the patients’ realistic audio recordings is of great significance to diagnose and monitor respiratory diseases, such as COVID-19. Many detection methods have been developed so far, but they are still unable to meet the practical requirements. In this paper, we present a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760237/ https://www.ncbi.nlm.nih.gov/pubmed/36569172 http://dx.doi.org/10.1016/j.bspc.2021.103304 |
_version_ | 1784852417305640960 |
---|---|
author | You, Mingyu Wang, Weihao Li, You Liu, Jiaming Xu, Xianghuai Qiu, Zhongmin |
author_facet | You, Mingyu Wang, Weihao Li, You Liu, Jiaming Xu, Xianghuai Qiu, Zhongmin |
author_sort | You, Mingyu |
collection | PubMed |
description | Automatic cough detection in the patients’ realistic audio recordings is of great significance to diagnose and monitor respiratory diseases, such as COVID-19. Many detection methods have been developed so far, but they are still unable to meet the practical requirements. In this paper, we present a deep convolutional bidirectional long short-term memory (C-BiLSTM) model with boundary regression for cough detection, where cough and non-cough parts need to be classified and located. We added convolutional layers before the LSTM to enhance the cough features and preserve the temporal information of the audio data. Considering the importance of the cough event integrity for subsequent analysis, the novel model includes an embedded boundary regression on the last feature map for both higher detection accuracy and more accurate boundaries. We delicately designed, collected and labelled a realistic audio dataset containing recordings of patients with respiratory diseases, named the Corp Dataset. 168 h of recordings with 9969 coughs from 42 different patients are included. The dataset is published online on the MARI Lab website (https://mari.tongji.edu.cn/info/1012/1030.htm). The results show that the system achieves a sensitivity of 84.13%, a specificity of 99.82% and an intersection-over-union (IoU) of 0.89, which is significantly superior to other related models. With the proposed method, all the criteria on cough detection significantly increased. The open source Corp Dataset provides useful material and a benchmark for researchers investigating cough detection. We propose the state-of-the-art system with boundary regression, laying the foundation for identifying cough sounds in real-world audio data. |
format | Online Article Text |
id | pubmed-9760237 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97602372022-12-19 Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression You, Mingyu Wang, Weihao Li, You Liu, Jiaming Xu, Xianghuai Qiu, Zhongmin Biomed Signal Process Control Article Automatic cough detection in the patients’ realistic audio recordings is of great significance to diagnose and monitor respiratory diseases, such as COVID-19. Many detection methods have been developed so far, but they are still unable to meet the practical requirements. In this paper, we present a deep convolutional bidirectional long short-term memory (C-BiLSTM) model with boundary regression for cough detection, where cough and non-cough parts need to be classified and located. We added convolutional layers before the LSTM to enhance the cough features and preserve the temporal information of the audio data. Considering the importance of the cough event integrity for subsequent analysis, the novel model includes an embedded boundary regression on the last feature map for both higher detection accuracy and more accurate boundaries. We delicately designed, collected and labelled a realistic audio dataset containing recordings of patients with respiratory diseases, named the Corp Dataset. 168 h of recordings with 9969 coughs from 42 different patients are included. The dataset is published online on the MARI Lab website (https://mari.tongji.edu.cn/info/1012/1030.htm). The results show that the system achieves a sensitivity of 84.13%, a specificity of 99.82% and an intersection-over-union (IoU) of 0.89, which is significantly superior to other related models. With the proposed method, all the criteria on cough detection significantly increased. The open source Corp Dataset provides useful material and a benchmark for researchers investigating cough detection. We propose the state-of-the-art system with boundary regression, laying the foundation for identifying cough sounds in real-world audio data. Elsevier Ltd. 2022-02 2021-11-11 /pmc/articles/PMC9760237/ /pubmed/36569172 http://dx.doi.org/10.1016/j.bspc.2021.103304 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article You, Mingyu Wang, Weihao Li, You Liu, Jiaming Xu, Xianghuai Qiu, Zhongmin Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression |
title | Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression |
title_full | Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression |
title_fullStr | Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression |
title_full_unstemmed | Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression |
title_short | Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression |
title_sort | automatic cough detection from realistic audio recordings using c-bilstm with boundary regression |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760237/ https://www.ncbi.nlm.nih.gov/pubmed/36569172 http://dx.doi.org/10.1016/j.bspc.2021.103304 |
work_keys_str_mv | AT youmingyu automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression AT wangweihao automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression AT liyou automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression AT liujiaming automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression AT xuxianghuai automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression AT qiuzhongmin automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression |