Cargando…

Speech as a Biomarker for COVID-19 Detection Using Machine Learning

The use of speech as a biomedical signal for diagnosing COVID-19 is investigated using statistical analysis of speech spectral features and classification algorithms based on machine learning. It is established that spectral features of speech, obtained by computing the short-time Fourier Transform...

Descripción completa

Detalles Bibliográficos
Autores principales:	Usman, Mohammed, Gunjan, Vinit Kumar, Wajid, Mohd, Zubair, Mohammed, Siddiquee, Kazy Noor-e-alam
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014833/ https://www.ncbi.nlm.nih.gov/pubmed/35444694 http://dx.doi.org/10.1155/2022/6093613

_version_	1784688262425608192
author	Usman, Mohammed Gunjan, Vinit Kumar Wajid, Mohd Zubair, Mohammed Siddiquee, Kazy Noor-e-alam
author_facet	Usman, Mohammed Gunjan, Vinit Kumar Wajid, Mohd Zubair, Mohammed Siddiquee, Kazy Noor-e-alam
author_sort	Usman, Mohammed
collection	PubMed
description	The use of speech as a biomedical signal for diagnosing COVID-19 is investigated using statistical analysis of speech spectral features and classification algorithms based on machine learning. It is established that spectral features of speech, obtained by computing the short-time Fourier Transform (STFT), get altered in a statistical sense as a result of physiological changes. These spectral features are then used as input features to machine learning-based classification algorithms to classify them as coming from a COVID-19 positive individual or not. Speech samples from healthy as well as “asymptomatic” COVID-19 positive individuals have been used in this study. It is shown that the RMS error of statistical distribution fitting is higher in the case of speech samples of COVID-19 positive speech samples as compared to the speech samples of healthy individuals. Five state-of-the-art machine learning classification algorithms have also been analyzed, and the performance evaluation metrics of these algorithms are also presented. The tuning of machine learning model parameters is done so as to minimize the misclassification of COVID-19 positive individuals as being COVID-19 negative since the cost associated with this misclassification is higher than the opposite misclassification. The best performance in terms of the “recall” metric is observed for the Decision Forest algorithm which gives a recall value of 0.7892.
format	Online Article Text
id	pubmed-9014833
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-90148332022-04-19 Speech as a Biomarker for COVID-19 Detection Using Machine Learning Usman, Mohammed Gunjan, Vinit Kumar Wajid, Mohd Zubair, Mohammed Siddiquee, Kazy Noor-e-alam Comput Intell Neurosci Research Article The use of speech as a biomedical signal for diagnosing COVID-19 is investigated using statistical analysis of speech spectral features and classification algorithms based on machine learning. It is established that spectral features of speech, obtained by computing the short-time Fourier Transform (STFT), get altered in a statistical sense as a result of physiological changes. These spectral features are then used as input features to machine learning-based classification algorithms to classify them as coming from a COVID-19 positive individual or not. Speech samples from healthy as well as “asymptomatic” COVID-19 positive individuals have been used in this study. It is shown that the RMS error of statistical distribution fitting is higher in the case of speech samples of COVID-19 positive speech samples as compared to the speech samples of healthy individuals. Five state-of-the-art machine learning classification algorithms have also been analyzed, and the performance evaluation metrics of these algorithms are also presented. The tuning of machine learning model parameters is done so as to minimize the misclassification of COVID-19 positive individuals as being COVID-19 negative since the cost associated with this misclassification is higher than the opposite misclassification. The best performance in terms of the “recall” metric is observed for the Decision Forest algorithm which gives a recall value of 0.7892. Hindawi 2022-04-18 /pmc/articles/PMC9014833/ /pubmed/35444694 http://dx.doi.org/10.1155/2022/6093613 Text en Copyright © 2022 Mohammed Usman et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Usman, Mohammed Gunjan, Vinit Kumar Wajid, Mohd Zubair, Mohammed Siddiquee, Kazy Noor-e-alam Speech as a Biomarker for COVID-19 Detection Using Machine Learning
title	Speech as a Biomarker for COVID-19 Detection Using Machine Learning
title_full	Speech as a Biomarker for COVID-19 Detection Using Machine Learning
title_fullStr	Speech as a Biomarker for COVID-19 Detection Using Machine Learning
title_full_unstemmed	Speech as a Biomarker for COVID-19 Detection Using Machine Learning
title_short	Speech as a Biomarker for COVID-19 Detection Using Machine Learning
title_sort	speech as a biomarker for covid-19 detection using machine learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014833/ https://www.ncbi.nlm.nih.gov/pubmed/35444694 http://dx.doi.org/10.1155/2022/6093613
work_keys_str_mv	AT usmanmohammed speechasabiomarkerforcovid19detectionusingmachinelearning AT gunjanvinitkumar speechasabiomarkerforcovid19detectionusingmachinelearning AT wajidmohd speechasabiomarkerforcovid19detectionusingmachinelearning AT zubairmohammed speechasabiomarkerforcovid19detectionusingmachinelearning AT siddiqueekazynoorealam speechasabiomarkerforcovid19detectionusingmachinelearning

Speech as a Biomarker for COVID-19 Detection Using Machine Learning

Ejemplares similares