Cargando…

Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker

OBJECTIVES: This study presents PD Predict, a machine learning system for Parkinson disease classification using voice as a biomarker. METHODS: We first created an original set of recordings from the mPower study, and then extracted several audio features, such as mel-frequency cepstral coefficient...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tougui, Ilias, Jilbab, Abdelilah, El Mhamdi, Jamal
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Korean Society of Medical Informatics 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9388925/ https://www.ncbi.nlm.nih.gov/pubmed/35982595 http://dx.doi.org/10.4258/hir.2022.28.3.210

_version_	1784770321546477568
author	Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal
author_facet	Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal
author_sort	Tougui, Ilias
collection	PubMed
description	OBJECTIVES: This study presents PD Predict, a machine learning system for Parkinson disease classification using voice as a biomarker. METHODS: We first created an original set of recordings from the mPower study, and then extracted several audio features, such as mel-frequency cepstral coefficient (MFCC) components and other classical speech features, using a windowing procedure. The generated dataset was then divided into training and holdout sets. The training set was used to train two machine learning pipelines, and their performance was estimated using a nested subject-wise cross-validation approach. The holdout set was used to assess the generalizability of the pipelines for unseen data. The final pipelines were implemented in PD Predict and accessed through a prediction endpoint developed using the Django REST Framework. PD Predict is a two-component system: a desktop application that records audio recordings, extracts audio features, and makes predictions; and a server-side web application that implements the machine learning pipelines and processes incoming requests with the extracted audio features to make predictions. Our system is deployed and accessible via the following link: https://pdpredict.herokuapp.com/. RESULTS: Both machine learning pipelines showed moderate performance, between 65% and 75% using the nested subject-wise cross-validation approach. Furthermore, they generalized well to unseen data and they did not overfit the training set. CONCLUSIONS: The architecture of PD Predict is clear, and the performance of the implemented machine learning pipelines is promising and confirms the usability of smartphone microphones for capturing digital biomarkers of disease.
format	Online Article Text
id	pubmed-9388925
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Korean Society of Medical Informatics
record_format	MEDLINE/PubMed
spelling	pubmed-93889252022-08-23 Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal Healthc Inform Res Original Article OBJECTIVES: This study presents PD Predict, a machine learning system for Parkinson disease classification using voice as a biomarker. METHODS: We first created an original set of recordings from the mPower study, and then extracted several audio features, such as mel-frequency cepstral coefficient (MFCC) components and other classical speech features, using a windowing procedure. The generated dataset was then divided into training and holdout sets. The training set was used to train two machine learning pipelines, and their performance was estimated using a nested subject-wise cross-validation approach. The holdout set was used to assess the generalizability of the pipelines for unseen data. The final pipelines were implemented in PD Predict and accessed through a prediction endpoint developed using the Django REST Framework. PD Predict is a two-component system: a desktop application that records audio recordings, extracts audio features, and makes predictions; and a server-side web application that implements the machine learning pipelines and processes incoming requests with the extracted audio features to make predictions. Our system is deployed and accessible via the following link: https://pdpredict.herokuapp.com/. RESULTS: Both machine learning pipelines showed moderate performance, between 65% and 75% using the nested subject-wise cross-validation approach. Furthermore, they generalized well to unseen data and they did not overfit the training set. CONCLUSIONS: The architecture of PD Predict is clear, and the performance of the implemented machine learning pipelines is promising and confirms the usability of smartphone microphones for capturing digital biomarkers of disease. Korean Society of Medical Informatics 2022-07 2022-07-31 /pmc/articles/PMC9388925/ /pubmed/35982595 http://dx.doi.org/10.4258/hir.2022.28.3.210 Text en © 2022 The Korean Society of Medical Informatics https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Article Tougui, Ilias Jilbab, Abdelilah El Mhamdi, Jamal Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker
title	Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker
title_full	Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker
title_fullStr	Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker
title_full_unstemmed	Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker
title_short	Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker
title_sort	machine learning smart system for parkinson disease classification using the voice as a biomarker
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9388925/ https://www.ncbi.nlm.nih.gov/pubmed/35982595 http://dx.doi.org/10.4258/hir.2022.28.3.210
work_keys_str_mv	AT touguiilias machinelearningsmartsystemforparkinsondiseaseclassificationusingthevoiceasabiomarker AT jilbababdelilah machinelearningsmartsystemforparkinsondiseaseclassificationusingthevoiceasabiomarker AT elmhamdijamal machinelearningsmartsystemforparkinsondiseaseclassificationusingthevoiceasabiomarker

Machine Learning Smart System for Parkinson Disease Classification Using the Voice as a Biomarker

Ejemplares similares