Cargando…
Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients
Parkinson’s disease (PD) is a chronic disease. No treatment stops its progression, and it presents symptoms in multiple areas. One way to understand the PD population is to investigate the clustering of patients by demographic and clinical similarities. Previous PD cluster studies included scores fr...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8534040/ https://www.ncbi.nlm.nih.gov/pubmed/34679355 http://dx.doi.org/10.3390/brainsci11101290 |
_version_ | 1784587459268444160 |
---|---|
author | Hendricks, Renee Khasawneh, Mohammad |
author_facet | Hendricks, Renee Khasawneh, Mohammad |
author_sort | Hendricks, Renee |
collection | PubMed |
description | Parkinson’s disease (PD) is a chronic disease. No treatment stops its progression, and it presents symptoms in multiple areas. One way to understand the PD population is to investigate the clustering of patients by demographic and clinical similarities. Previous PD cluster studies included scores from clinical surveys, which provide a numerical but ordinal, non-linear value. In addition, these studies did not include categorical variables, as the clustering method utilized was not applicable to categorical variables. It was discovered that the numerical values of patient age and disease duration were similar among past cluster results, pointing to the need to exclude these values. This paper proposes a novel and automatic discovery method to cluster PD patients by incorporating categorical variables. No estimate of the number of clusters is required as input, whereas the previous cluster methods require a guess from the end user in order for the method to be initiated. Using a patient dataset from the Parkinson’s Progression Markers Initiative (PPMI) website to demonstrate the new clustering technique, our results showed that this method provided an accurate separation of the patients. In addition, this method provides an explainable process and an easy way to interpret clusters and describe patient subtypes. |
format | Online Article Text |
id | pubmed-8534040 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85340402021-10-23 Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients Hendricks, Renee Khasawneh, Mohammad Brain Sci Article Parkinson’s disease (PD) is a chronic disease. No treatment stops its progression, and it presents symptoms in multiple areas. One way to understand the PD population is to investigate the clustering of patients by demographic and clinical similarities. Previous PD cluster studies included scores from clinical surveys, which provide a numerical but ordinal, non-linear value. In addition, these studies did not include categorical variables, as the clustering method utilized was not applicable to categorical variables. It was discovered that the numerical values of patient age and disease duration were similar among past cluster results, pointing to the need to exclude these values. This paper proposes a novel and automatic discovery method to cluster PD patients by incorporating categorical variables. No estimate of the number of clusters is required as input, whereas the previous cluster methods require a guess from the end user in order for the method to be initiated. Using a patient dataset from the Parkinson’s Progression Markers Initiative (PPMI) website to demonstrate the new clustering technique, our results showed that this method provided an accurate separation of the patients. In addition, this method provides an explainable process and an easy way to interpret clusters and describe patient subtypes. MDPI 2021-09-29 /pmc/articles/PMC8534040/ /pubmed/34679355 http://dx.doi.org/10.3390/brainsci11101290 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Hendricks, Renee Khasawneh, Mohammad Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients |
title | Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients |
title_full | Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients |
title_fullStr | Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients |
title_full_unstemmed | Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients |
title_short | Cluster Analysis of Categorical Variables of Parkinson’s Disease Patients |
title_sort | cluster analysis of categorical variables of parkinson’s disease patients |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8534040/ https://www.ncbi.nlm.nih.gov/pubmed/34679355 http://dx.doi.org/10.3390/brainsci11101290 |
work_keys_str_mv | AT hendricksrenee clusteranalysisofcategoricalvariablesofparkinsonsdiseasepatients AT khasawnehmohammad clusteranalysisofcategoricalvariablesofparkinsonsdiseasepatients |