Cargando…

Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features

Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This resea...

Descripción completa

Detalles Bibliográficos
Autores principales: Jung, Shing-Yun, Liao, Chia-Hung, Wu, Yu-Sheng, Yuan, Shyan-Ming, Sun, Chuen-Tsai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074359/
https://www.ncbi.nlm.nih.gov/pubmed/33924146
http://dx.doi.org/10.3390/diagnostics11040732
_version_ 1783684337206558720
author Jung, Shing-Yun
Liao, Chia-Hung
Wu, Yu-Sheng
Yuan, Shyan-Ming
Sun, Chuen-Tsai
author_facet Jung, Shing-Yun
Liao, Chia-Hung
Wu, Yu-Sheng
Yuan, Shyan-Ming
Sun, Chuen-Tsai
author_sort Jung, Shing-Yun
collection PubMed
description Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This research aims to propose a feature engineering process that extracts the dedicated features for the depthwise separable convolution neural network (DS-CNN) to classify lung sounds accurately and efficiently. We extracted a total of three features for the shrunk DS-CNN model: the short-time Fourier-transformed (STFT) feature, the Mel-frequency cepstrum coefficient (MFCC) feature, and the fused features of these two. We observed that while DS-CNN models trained on either the STFT or the MFCC feature achieved an accuracy of 82.27% and 73.02%, respectively, fusing both features led to a higher accuracy of 85.74%. In addition, our method achieved 16 times higher inference speed on an edge device and only 0.45% less accuracy than RespireNet. This finding indicates that the fusion of the STFT and MFCC features and DS-CNN would be a model design for lightweight edge devices to achieve accurate AI-aided detection of lung diseases.
format Online
Article
Text
id pubmed-8074359
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80743592021-04-27 Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features Jung, Shing-Yun Liao, Chia-Hung Wu, Yu-Sheng Yuan, Shyan-Ming Sun, Chuen-Tsai Diagnostics (Basel) Article Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This research aims to propose a feature engineering process that extracts the dedicated features for the depthwise separable convolution neural network (DS-CNN) to classify lung sounds accurately and efficiently. We extracted a total of three features for the shrunk DS-CNN model: the short-time Fourier-transformed (STFT) feature, the Mel-frequency cepstrum coefficient (MFCC) feature, and the fused features of these two. We observed that while DS-CNN models trained on either the STFT or the MFCC feature achieved an accuracy of 82.27% and 73.02%, respectively, fusing both features led to a higher accuracy of 85.74%. In addition, our method achieved 16 times higher inference speed on an edge device and only 0.45% less accuracy than RespireNet. This finding indicates that the fusion of the STFT and MFCC features and DS-CNN would be a model design for lightweight edge devices to achieve accurate AI-aided detection of lung diseases. MDPI 2021-04-20 /pmc/articles/PMC8074359/ /pubmed/33924146 http://dx.doi.org/10.3390/diagnostics11040732 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jung, Shing-Yun
Liao, Chia-Hung
Wu, Yu-Sheng
Yuan, Shyan-Ming
Sun, Chuen-Tsai
Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features
title Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features
title_full Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features
title_fullStr Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features
title_full_unstemmed Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features
title_short Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features
title_sort efficiently classifying lung sounds through depthwise separable cnn models with fused stft and mfcc features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074359/
https://www.ncbi.nlm.nih.gov/pubmed/33924146
http://dx.doi.org/10.3390/diagnostics11040732
work_keys_str_mv AT jungshingyun efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures
AT liaochiahung efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures
AT wuyusheng efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures
AT yuanshyanming efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures
AT sunchuentsai efficientlyclassifyinglungsoundsthroughdepthwiseseparablecnnmodelswithfusedstftandmfccfeatures