Cargando…
Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features
Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This resea...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074359/ https://www.ncbi.nlm.nih.gov/pubmed/33924146 http://dx.doi.org/10.3390/diagnostics11040732 |
_version_ | 1783684337206558720 |
---|---|
author | Jung, Shing-Yun Liao, Chia-Hung Wu, Yu-Sheng Yuan, Shyan-Ming Sun, Chuen-Tsai |
author_facet | Jung, Shing-Yun Liao, Chia-Hung Wu, Yu-Sheng Yuan, Shyan-Ming Sun, Chuen-Tsai |
author_sort | Jung, Shing-Yun |
collection | PubMed |
description | Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This research aims to propose a feature engineering process that extracts the dedicated features for the depthwise separable convolution neural network (DS-CNN) to classify lung sounds accurately and efficiently. We extracted a total of three features for the shrunk DS-CNN model: the short-time Fourier-transformed (STFT) feature, the Mel-frequency cepstrum coefficient (MFCC) feature, and the fused features of these two. We observed that while DS-CNN models trained on either the STFT or the MFCC feature achieved an accuracy of 82.27% and 73.02%, respectively, fusing both features led to a higher accuracy of 85.74%. In addition, our method achieved 16 times higher inference speed on an edge device and only 0.45% less accuracy than RespireNet. This finding indicates that the fusion of the STFT and MFCC features and DS-CNN would be a model design for lightweight edge devices to achieve accurate AI-aided detection of lung diseases. |
format | Online Article Text |
id | pubmed-8074359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-80743592021-04-27 Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features Jung, Shing-Yun Liao, Chia-Hung Wu, Yu-Sheng Yuan, Shyan-Ming Sun, Chuen-Tsai Diagnostics (Basel) Article Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This research aims to propose a feature engineering process that extracts the dedicated features for the depthwise separable convolution neural network (DS-CNN) to classify lung sounds accurately and efficiently. We extracted a total of three features for the shrunk DS-CNN model: the short-time Fourier-transformed (STFT) feature, the Mel-frequency cepstrum coefficient (MFCC) feature, and the fused features of these two. We observed that while DS-CNN models trained on either the STFT or the MFCC feature achieved an accuracy of 82.27% and 73.02%, respectively, fusing both features led to a higher accuracy of 85.74%. In addition, our method achieved 16 times higher inference speed on an edge device and only 0.45% less accuracy than RespireNet. This finding indicates that the fusion of the STFT and MFCC features and DS-CNN would be a model design for lightweight edge devices to achieve accurate AI-aided detection of lung diseases. MDPI 2021-04-20 /pmc/articles/PMC8074359/ /pubmed/33924146 http://dx.doi.org/10.3390/diagnostics11040732 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jung, Shing-Yun Liao, Chia-Hung Wu, Yu-Sheng Yuan, Shyan-Ming Sun, Chuen-Tsai Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features |
title | Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features |
title_full | Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features |
title_fullStr | Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features |
title_full_unstemmed | Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features |
title_short | Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features |
title_sort | efficiently classifying lung sounds through depthwise separable cnn models with fused stft and mfcc features |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074359/ https://www.ncbi.nlm.nih.gov/pubmed/33924146 http://dx.doi.org/10.3390/diagnostics11040732 |
work_keys_str_mv | AT jungshingyun efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures AT liaochiahung efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures AT wuyusheng efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures AT yuanshyanming efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures AT sunchuentsai efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures |