Cargando…
Significance of voiced and unvoiced speech segments for the detection of common cold
This work investigates the significance of the voiced and unvoiced region for detecting common cold from the speech signal. In literature, the entire speech signal is processed to detect the common cold and other diseases. This study uses a short-time energy-based approach to segment the voiced and...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664442/ https://www.ncbi.nlm.nih.gov/pubmed/36408330 http://dx.doi.org/10.1007/s11760-022-02389-8 |
_version_ | 1784831101659774976 |
---|---|
author | Warule, Pankaj Mishra, Siba Prasad Deb, Suman |
author_facet | Warule, Pankaj Mishra, Siba Prasad Deb, Suman |
author_sort | Warule, Pankaj |
collection | PubMed |
description | This work investigates the significance of the voiced and unvoiced region for detecting common cold from the speech signal. In literature, the entire speech signal is processed to detect the common cold and other diseases. This study uses a short-time energy-based approach to segment the voiced and unvoiced region of the speech signal. Then, frame-wise mel frequency cepstral coefficients (MFCC) features are extracted from the voiced and unvoiced segments of each speech utterance, and statistics (mean, variance, skewness, and kurtosis) are calculated to get the feature vector for each speech utterance. The support vector machine (SVM) is utilized to analyze the performance of features extracted from the voiced and unvoiced region. Result shows that the feature extracted from voiced segments, unvoiced segments, and complete active speech (CAS) gives almost similar results using the MFCC features and SVM classifier. Therefore, rather than processing the CAS, we can process the unvoiced speech segments, which have fewer frames compared to CAS and voiced regions of speech. The processing of solely unvoiced segments can reduce the time and computation complexity of a speech signal-based common cold detection system. |
format | Online Article Text |
id | pubmed-9664442 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-96644422022-11-14 Significance of voiced and unvoiced speech segments for the detection of common cold Warule, Pankaj Mishra, Siba Prasad Deb, Suman Signal Image Video Process Original Paper This work investigates the significance of the voiced and unvoiced region for detecting common cold from the speech signal. In literature, the entire speech signal is processed to detect the common cold and other diseases. This study uses a short-time energy-based approach to segment the voiced and unvoiced region of the speech signal. Then, frame-wise mel frequency cepstral coefficients (MFCC) features are extracted from the voiced and unvoiced segments of each speech utterance, and statistics (mean, variance, skewness, and kurtosis) are calculated to get the feature vector for each speech utterance. The support vector machine (SVM) is utilized to analyze the performance of features extracted from the voiced and unvoiced region. Result shows that the feature extracted from voiced segments, unvoiced segments, and complete active speech (CAS) gives almost similar results using the MFCC features and SVM classifier. Therefore, rather than processing the CAS, we can process the unvoiced speech segments, which have fewer frames compared to CAS and voiced regions of speech. The processing of solely unvoiced segments can reduce the time and computation complexity of a speech signal-based common cold detection system. Springer London 2022-11-15 2023 /pmc/articles/PMC9664442/ /pubmed/36408330 http://dx.doi.org/10.1007/s11760-022-02389-8 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Paper Warule, Pankaj Mishra, Siba Prasad Deb, Suman Significance of voiced and unvoiced speech segments for the detection of common cold |
title | Significance of voiced and unvoiced speech segments for the detection of common cold |
title_full | Significance of voiced and unvoiced speech segments for the detection of common cold |
title_fullStr | Significance of voiced and unvoiced speech segments for the detection of common cold |
title_full_unstemmed | Significance of voiced and unvoiced speech segments for the detection of common cold |
title_short | Significance of voiced and unvoiced speech segments for the detection of common cold |
title_sort | significance of voiced and unvoiced speech segments for the detection of common cold |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664442/ https://www.ncbi.nlm.nih.gov/pubmed/36408330 http://dx.doi.org/10.1007/s11760-022-02389-8 |
work_keys_str_mv | AT warulepankaj significanceofvoicedandunvoicedspeechsegmentsforthedetectionofcommoncold AT mishrasibaprasad significanceofvoicedandunvoicedspeechsegmentsforthedetectionofcommoncold AT debsuman significanceofvoicedandunvoicedspeechsegmentsforthedetectionofcommoncold |