Cargando…

A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs

Medical audio classification for lung abnormality diagnosis is a challenging problem owing to comparatively unstructured audio signals present in the respiratory sound clips. To tackle such challenges, we propose an ensemble model by incorporating diverse deep neural networks with attention mechanis...

Descripción completa

Detalles Bibliográficos
Autores principales: Wall, Conor, Zhang, Li, Yu, Yonghong, Kumar, Akshi, Gao, Rong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9332569/
https://www.ncbi.nlm.nih.gov/pubmed/35898070
http://dx.doi.org/10.3390/s22155566
_version_ 1784758680230559744
author Wall, Conor
Zhang, Li
Yu, Yonghong
Kumar, Akshi
Gao, Rong
author_facet Wall, Conor
Zhang, Li
Yu, Yonghong
Kumar, Akshi
Gao, Rong
author_sort Wall, Conor
collection PubMed
description Medical audio classification for lung abnormality diagnosis is a challenging problem owing to comparatively unstructured audio signals present in the respiratory sound clips. To tackle such challenges, we propose an ensemble model by incorporating diverse deep neural networks with attention mechanisms for undertaking lung abnormality and COVID-19 diagnosis using respiratory, speech, and coughing audio inputs. Specifically, four base deep networks are proposed, which include attention-based Convolutional Recurrent Neural Network (A-CRNN), attention-based bidirectional Long Short-Term Memory (A-BiLSTM), attention-based bidirectional Gated Recurrent Unit (A-BiGRU), as well as Convolutional Neural Network (CNN). A Particle Swarm Optimization (PSO) algorithm is used to optimize the training parameters of each network. An ensemble mechanism is used to integrate the outputs of these base networks by averaging the probability predictions of each class. Evaluated using respiratory ICBHI, Coswara breathing, speech, and cough datasets, as well as a combination of ICBHI and Coswara breathing databases, our ensemble model and base networks achieve ICBHI scores ranging from 0.920 to 0.9766. Most importantly, the empirical results indicate that a positive COVID-19 diagnosis can be distinguished to a high degree from other more common respiratory diseases using audio recordings, based on the combined ICBHI and Coswara breathing datasets.
format Online
Article
Text
id pubmed-9332569
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93325692022-07-29 A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs Wall, Conor Zhang, Li Yu, Yonghong Kumar, Akshi Gao, Rong Sensors (Basel) Article Medical audio classification for lung abnormality diagnosis is a challenging problem owing to comparatively unstructured audio signals present in the respiratory sound clips. To tackle such challenges, we propose an ensemble model by incorporating diverse deep neural networks with attention mechanisms for undertaking lung abnormality and COVID-19 diagnosis using respiratory, speech, and coughing audio inputs. Specifically, four base deep networks are proposed, which include attention-based Convolutional Recurrent Neural Network (A-CRNN), attention-based bidirectional Long Short-Term Memory (A-BiLSTM), attention-based bidirectional Gated Recurrent Unit (A-BiGRU), as well as Convolutional Neural Network (CNN). A Particle Swarm Optimization (PSO) algorithm is used to optimize the training parameters of each network. An ensemble mechanism is used to integrate the outputs of these base networks by averaging the probability predictions of each class. Evaluated using respiratory ICBHI, Coswara breathing, speech, and cough datasets, as well as a combination of ICBHI and Coswara breathing databases, our ensemble model and base networks achieve ICBHI scores ranging from 0.920 to 0.9766. Most importantly, the empirical results indicate that a positive COVID-19 diagnosis can be distinguished to a high degree from other more common respiratory diseases using audio recordings, based on the combined ICBHI and Coswara breathing datasets. MDPI 2022-07-26 /pmc/articles/PMC9332569/ /pubmed/35898070 http://dx.doi.org/10.3390/s22155566 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wall, Conor
Zhang, Li
Yu, Yonghong
Kumar, Akshi
Gao, Rong
A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs
title A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs
title_full A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs
title_fullStr A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs
title_full_unstemmed A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs
title_short A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs
title_sort deep ensemble neural network with attention mechanisms for lung abnormality classification using audio inputs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9332569/
https://www.ncbi.nlm.nih.gov/pubmed/35898070
http://dx.doi.org/10.3390/s22155566
work_keys_str_mv AT wallconor adeepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT zhangli adeepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT yuyonghong adeepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT kumarakshi adeepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT gaorong adeepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT wallconor deepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT zhangli deepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT yuyonghong deepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT kumarakshi deepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs
AT gaorong deepensembleneuralnetworkwithattentionmechanismsforlungabnormalityclassificationusingaudioinputs