Cargando…

Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease

OBJECTIVES: Parkinson’s disease (PD) is the second most common neurodegenerative disorder; it affects more than 10 million people worldwide. Detecting PD usually requires a professional assessment by an expert, and investigation of the voice as a biomarker of the disease could be effective in speedi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tougui, Ilias, Jilbab, Abdelilah, El Mhamdi, Jamal
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Korean Society of Medical Informatics 2020
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7674819/ https://www.ncbi.nlm.nih.gov/pubmed/33190461 http://dx.doi.org/10.4258/hir.2020.26.4.274

_version_	1783611588378361856
author	Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal
author_facet	Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal
author_sort	Tougui, Ilias
collection	PubMed
description	OBJECTIVES: Parkinson’s disease (PD) is the second most common neurodegenerative disorder; it affects more than 10 million people worldwide. Detecting PD usually requires a professional assessment by an expert, and investigation of the voice as a biomarker of the disease could be effective in speeding up the diagnostic process. METHODS: We present our methodology in which we distinguish PD patients from healthy controls (HC) using a large sample of 18,210 smartphone recordings. Those recordings were processed by an audio processing technique to create a final dataset of 80,594 instances and 138 features from the time, frequency, and cepstral domains. This dataset was preprocessed and normalized to create baseline machine-learning models using four classifiers, namely, linear support vector machine, K-nearest neighbor, random forest, and extreme gradient boosting (XGBoost). We divided our dataset into training and held-out test sets. Then we used stratified 5-fold cross-validation and four performance measures: accuracy, sensitivity, specificity, and F1-score to assess the performance of the models. We applied two feature selection methods, analysis of variance (ANOVA) and least absolute shrinkage and selection operator (LASSO), to reduce the dimensionality of the dataset by selecting the best subset of features that maximizes the performance of the classifiers. RESULTS: LASSO outperformed ANOVA with almost the same number of features. With 33 features, XGBoost achieved a maximum accuracy of 95.31% on training data, and 95.78% by predicting unseen data. CONCLUSIONS: Developing a smartphone-based system that implements machine-learning techniques is an effective way to diagnose PD using the voice as a biomarker.
format	Online Article Text
id	pubmed-7674819
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Korean Society of Medical Informatics
record_format	MEDLINE/PubMed
spelling	pubmed-76748192020-11-19 Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal Healthc Inform Res Original Article OBJECTIVES: Parkinson’s disease (PD) is the second most common neurodegenerative disorder; it affects more than 10 million people worldwide. Detecting PD usually requires a professional assessment by an expert, and investigation of the voice as a biomarker of the disease could be effective in speeding up the diagnostic process. METHODS: We present our methodology in which we distinguish PD patients from healthy controls (HC) using a large sample of 18,210 smartphone recordings. Those recordings were processed by an audio processing technique to create a final dataset of 80,594 instances and 138 features from the time, frequency, and cepstral domains. This dataset was preprocessed and normalized to create baseline machine-learning models using four classifiers, namely, linear support vector machine, K-nearest neighbor, random forest, and extreme gradient boosting (XGBoost). We divided our dataset into training and held-out test sets. Then we used stratified 5-fold cross-validation and four performance measures: accuracy, sensitivity, specificity, and F1-score to assess the performance of the models. We applied two feature selection methods, analysis of variance (ANOVA) and least absolute shrinkage and selection operator (LASSO), to reduce the dimensionality of the dataset by selecting the best subset of features that maximizes the performance of the classifiers. RESULTS: LASSO outperformed ANOVA with almost the same number of features. With 33 features, XGBoost achieved a maximum accuracy of 95.31% on training data, and 95.78% by predicting unseen data. CONCLUSIONS: Developing a smartphone-based system that implements machine-learning techniques is an effective way to diagnose PD using the voice as a biomarker. Korean Society of Medical Informatics 2020-10 2020-10-31 /pmc/articles/PMC7674819/ /pubmed/33190461 http://dx.doi.org/10.4258/hir.2020.26.4.274 Text en © 2020 The Korean Society of Medical Informatics This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Article Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease
title	Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease
title_full	Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease
title_fullStr	Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease
title_full_unstemmed	Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease
title_short	Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease
title_sort	analysis of smartphone recordings in time, frequency, and cepstral domains to classify parkinson’s disease
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7674819/ https://www.ncbi.nlm.nih.gov/pubmed/33190461 http://dx.doi.org/10.4258/hir.2020.26.4.274
work_keys_str_mv	AT touguiilias analysisofsmartphonerecordingsintimefrequencyandcepstraldomainstoclassifyparkinsonsdisease AT jilbababdelilah analysisofsmartphonerecordingsintimefrequencyandcepstraldomainstoclassifyparkinsonsdisease AT elmhamdijamal analysisofsmartphonerecordingsintimefrequencyandcepstraldomainstoclassifyparkinsonsdisease

Analysis of Smartphone Recordings in Time, Frequency, and Cepstral Domains to Classify Parkinson’s Disease

Ejemplares similares