Cargando…
Speech phoneme and spectral smearing based non-invasive COVID-19 detection
COVID-19 is a deadly viral infection that mainly affects the nasopharyngeal and oropharyngeal cavities before the lung in the human body. Early detection followed by immediate treatment can potentially reduce lung invasion and decrease fatality. Recently, several COVID-19 detections methods have bee...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9847386/ https://www.ncbi.nlm.nih.gov/pubmed/36686850 http://dx.doi.org/10.3389/frai.2022.1035805 |
_version_ | 1784871447018078208 |
---|---|
author | Mishra, Soumya Dash, Tusar Kanti Panda, Ganapati |
author_facet | Mishra, Soumya Dash, Tusar Kanti Panda, Ganapati |
author_sort | Mishra, Soumya |
collection | PubMed |
description | COVID-19 is a deadly viral infection that mainly affects the nasopharyngeal and oropharyngeal cavities before the lung in the human body. Early detection followed by immediate treatment can potentially reduce lung invasion and decrease fatality. Recently, several COVID-19 detections methods have been proposed using cough and breath sounds. However, very little study has been done on the use of phoneme analysis and the smearing of the audio signal in COVID-19 detection. In this paper, this problem has been addressed and the classification of speech samples has been carried out in COVID-19-positive and healthy audio samples. Additionally, the grouping of the phonemes based on reference classification accuracies have been proposed for effectiveness and faster detection of the disease at a primary stage. The Mel and Gammatone Cepstral coefficients and their derivatives are used as the features for five standard machine learning-based classifiers. It is observed that the generalized additive model provides the highest accuracy of 97.22% for the phoneme grouping “/t//r//n//g//l/.” This smearing-based phoneme classification technique can also be used in the future to classify other speech-related disease detections. |
format | Online Article Text |
id | pubmed-9847386 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98473862023-01-19 Speech phoneme and spectral smearing based non-invasive COVID-19 detection Mishra, Soumya Dash, Tusar Kanti Panda, Ganapati Front Artif Intell Artificial Intelligence COVID-19 is a deadly viral infection that mainly affects the nasopharyngeal and oropharyngeal cavities before the lung in the human body. Early detection followed by immediate treatment can potentially reduce lung invasion and decrease fatality. Recently, several COVID-19 detections methods have been proposed using cough and breath sounds. However, very little study has been done on the use of phoneme analysis and the smearing of the audio signal in COVID-19 detection. In this paper, this problem has been addressed and the classification of speech samples has been carried out in COVID-19-positive and healthy audio samples. Additionally, the grouping of the phonemes based on reference classification accuracies have been proposed for effectiveness and faster detection of the disease at a primary stage. The Mel and Gammatone Cepstral coefficients and their derivatives are used as the features for five standard machine learning-based classifiers. It is observed that the generalized additive model provides the highest accuracy of 97.22% for the phoneme grouping “/t//r//n//g//l/.” This smearing-based phoneme classification technique can also be used in the future to classify other speech-related disease detections. Frontiers Media S.A. 2023-01-04 /pmc/articles/PMC9847386/ /pubmed/36686850 http://dx.doi.org/10.3389/frai.2022.1035805 Text en Copyright © 2023 Mishra, Dash and Panda. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Mishra, Soumya Dash, Tusar Kanti Panda, Ganapati Speech phoneme and spectral smearing based non-invasive COVID-19 detection |
title | Speech phoneme and spectral smearing based non-invasive COVID-19 detection |
title_full | Speech phoneme and spectral smearing based non-invasive COVID-19 detection |
title_fullStr | Speech phoneme and spectral smearing based non-invasive COVID-19 detection |
title_full_unstemmed | Speech phoneme and spectral smearing based non-invasive COVID-19 detection |
title_short | Speech phoneme and spectral smearing based non-invasive COVID-19 detection |
title_sort | speech phoneme and spectral smearing based non-invasive covid-19 detection |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9847386/ https://www.ncbi.nlm.nih.gov/pubmed/36686850 http://dx.doi.org/10.3389/frai.2022.1035805 |
work_keys_str_mv | AT mishrasoumya speechphonemeandspectralsmearingbasednoninvasivecovid19detection AT dashtusarkanti speechphonemeandspectralsmearingbasednoninvasivecovid19detection AT pandaganapati speechphonemeandspectralsmearingbasednoninvasivecovid19detection |