Cargando…

Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression

Automatic cough detection in the patients’ realistic audio recordings is of great significance to diagnose and monitor respiratory diseases, such as COVID-19. Many detection methods have been developed so far, but they are still unable to meet the practical requirements. In this paper, we present a...

Descripción completa

Detalles Bibliográficos
Autores principales: You, Mingyu, Wang, Weihao, Li, You, Liu, Jiaming, Xu, Xianghuai, Qiu, Zhongmin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760237/
https://www.ncbi.nlm.nih.gov/pubmed/36569172
http://dx.doi.org/10.1016/j.bspc.2021.103304
_version_ 1784852417305640960
author You, Mingyu
Wang, Weihao
Li, You
Liu, Jiaming
Xu, Xianghuai
Qiu, Zhongmin
author_facet You, Mingyu
Wang, Weihao
Li, You
Liu, Jiaming
Xu, Xianghuai
Qiu, Zhongmin
author_sort You, Mingyu
collection PubMed
description Automatic cough detection in the patients’ realistic audio recordings is of great significance to diagnose and monitor respiratory diseases, such as COVID-19. Many detection methods have been developed so far, but they are still unable to meet the practical requirements. In this paper, we present a deep convolutional bidirectional long short-term memory (C-BiLSTM) model with boundary regression for cough detection, where cough and non-cough parts need to be classified and located. We added convolutional layers before the LSTM to enhance the cough features and preserve the temporal information of the audio data. Considering the importance of the cough event integrity for subsequent analysis, the novel model includes an embedded boundary regression on the last feature map for both higher detection accuracy and more accurate boundaries. We delicately designed, collected and labelled a realistic audio dataset containing recordings of patients with respiratory diseases, named the Corp Dataset. 168 h of recordings with 9969 coughs from 42 different patients are included. The dataset is published online on the MARI Lab website (https://mari.tongji.edu.cn/info/1012/1030.htm). The results show that the system achieves a sensitivity of 84.13%, a specificity of 99.82% and an intersection-over-union (IoU) of 0.89, which is significantly superior to other related models. With the proposed method, all the criteria on cough detection significantly increased. The open source Corp Dataset provides useful material and a benchmark for researchers investigating cough detection. We propose the state-of-the-art system with boundary regression, laying the foundation for identifying cough sounds in real-world audio data.
format Online
Article
Text
id pubmed-9760237
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-97602372022-12-19 Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression You, Mingyu Wang, Weihao Li, You Liu, Jiaming Xu, Xianghuai Qiu, Zhongmin Biomed Signal Process Control Article Automatic cough detection in the patients’ realistic audio recordings is of great significance to diagnose and monitor respiratory diseases, such as COVID-19. Many detection methods have been developed so far, but they are still unable to meet the practical requirements. In this paper, we present a deep convolutional bidirectional long short-term memory (C-BiLSTM) model with boundary regression for cough detection, where cough and non-cough parts need to be classified and located. We added convolutional layers before the LSTM to enhance the cough features and preserve the temporal information of the audio data. Considering the importance of the cough event integrity for subsequent analysis, the novel model includes an embedded boundary regression on the last feature map for both higher detection accuracy and more accurate boundaries. We delicately designed, collected and labelled a realistic audio dataset containing recordings of patients with respiratory diseases, named the Corp Dataset. 168 h of recordings with 9969 coughs from 42 different patients are included. The dataset is published online on the MARI Lab website (https://mari.tongji.edu.cn/info/1012/1030.htm). The results show that the system achieves a sensitivity of 84.13%, a specificity of 99.82% and an intersection-over-union (IoU) of 0.89, which is significantly superior to other related models. With the proposed method, all the criteria on cough detection significantly increased. The open source Corp Dataset provides useful material and a benchmark for researchers investigating cough detection. We propose the state-of-the-art system with boundary regression, laying the foundation for identifying cough sounds in real-world audio data. Elsevier Ltd. 2022-02 2021-11-11 /pmc/articles/PMC9760237/ /pubmed/36569172 http://dx.doi.org/10.1016/j.bspc.2021.103304 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
You, Mingyu
Wang, Weihao
Li, You
Liu, Jiaming
Xu, Xianghuai
Qiu, Zhongmin
Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression
title Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression
title_full Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression
title_fullStr Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression
title_full_unstemmed Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression
title_short Automatic cough detection from realistic audio recordings using C-BiLSTM with boundary regression
title_sort automatic cough detection from realistic audio recordings using c-bilstm with boundary regression
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760237/
https://www.ncbi.nlm.nih.gov/pubmed/36569172
http://dx.doi.org/10.1016/j.bspc.2021.103304
work_keys_str_mv AT youmingyu automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression
AT wangweihao automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression
AT liyou automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression
AT liujiaming automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression
AT xuxianghuai automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression
AT qiuzhongmin automaticcoughdetectionfromrealisticaudiorecordingsusingcbilstmwithboundaryregression