Cargando…

Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients

Parkinson’s disease (PD) is a chronic disease. No treatment stops its progression, and it presents symptoms in multiple areas. One way to understand the PD population is to investigate the clustering of patients by demographic and clinical similarities. Previous PD cluster studies included scores fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Hendricks, Renee, Khasawneh, Mohammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8534040/
https://www.ncbi.nlm.nih.gov/pubmed/34679355
http://dx.doi.org/10.3390/brainsci11101290
_version_ 1784587459268444160
author Hendricks, Renee
Khasawneh, Mohammad
author_facet Hendricks, Renee
Khasawneh, Mohammad
author_sort Hendricks, Renee
collection PubMed
description Parkinson’s disease (PD) is a chronic disease. No treatment stops its progression, and it presents symptoms in multiple areas. One way to understand the PD population is to investigate the clustering of patients by demographic and clinical similarities. Previous PD cluster studies included scores from clinical surveys, which provide a numerical but ordinal, non-linear value. In addition, these studies did not include categorical variables, as the clustering method utilized was not applicable to categorical variables. It was discovered that the numerical values of patient age and disease duration were similar among past cluster results, pointing to the need to exclude these values. This paper proposes a novel and automatic discovery method to cluster PD patients by incorporating categorical variables. No estimate of the number of clusters is required as input, whereas the previous cluster methods require a guess from the end user in order for the method to be initiated. Using a patient dataset from the Parkinson’s Progression Markers Initiative (PPMI) website to demonstrate the new clustering technique, our results showed that this method provided an accurate separation of the patients. In addition, this method provides an explainable process and an easy way to interpret clusters and describe patient subtypes.
format Online
Article
Text
id pubmed-8534040
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85340402021-10-23 Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients Hendricks, Renee Khasawneh, Mohammad Brain Sci Article Parkinson’s disease (PD) is a chronic disease. No treatment stops its progression, and it presents symptoms in multiple areas. One way to understand the PD population is to investigate the clustering of patients by demographic and clinical similarities. Previous PD cluster studies included scores from clinical surveys, which provide a numerical but ordinal, non-linear value. In addition, these studies did not include categorical variables, as the clustering method utilized was not applicable to categorical variables. It was discovered that the numerical values of patient age and disease duration were similar among past cluster results, pointing to the need to exclude these values. This paper proposes a novel and automatic discovery method to cluster PD patients by incorporating categorical variables. No estimate of the number of clusters is required as input, whereas the previous cluster methods require a guess from the end user in order for the method to be initiated. Using a patient dataset from the Parkinson’s Progression Markers Initiative (PPMI) website to demonstrate the new clustering technique, our results showed that this method provided an accurate separation of the patients. In addition, this method provides an explainable process and an easy way to interpret clusters and describe patient subtypes. MDPI 2021-09-29 /pmc/articles/PMC8534040/ /pubmed/34679355 http://dx.doi.org/10.3390/brainsci11101290 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hendricks, Renee
Khasawneh, Mohammad
Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients
title Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients
title_full Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients
title_fullStr Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients
title_full_unstemmed Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients
title_short Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients
title_sort cluster analysis of categorical variables of parkinson’s disease patients
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8534040/
https://www.ncbi.nlm.nih.gov/pubmed/34679355
http://dx.doi.org/10.3390/brainsci11101290
work_keys_str_mv AT hendricksrenee clusteranalysisofcategoricalvariablesofparkinsonsdiseasepatients
AT khasawnehmohammad clusteranalysisofcategoricalvariablesofparkinsonsdiseasepatients