Cargando…
Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
Predicting recovery after trauma is important to provide patients a perspective on their estimated future health, to engage in shared decision making and target interventions to relevant patient groups. In the present study, several unsupervised techniques are employed to cluster patients based on l...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9550811/ https://www.ncbi.nlm.nih.gov/pubmed/36216874 http://dx.doi.org/10.1038/s41598-022-21390-2 |
_version_ | 1784805965927809024 |
---|---|
author | Stoitsas, Kostas Bahulikar, Saurabh de Munter, Leonie de Jongh, Mariska A. C. Jansen, Maria A. C. Jung, Merel M. van Wingerden, Marijn Van Deun, Katrijn |
author_facet | Stoitsas, Kostas Bahulikar, Saurabh de Munter, Leonie de Jongh, Mariska A. C. Jansen, Maria A. C. Jung, Merel M. van Wingerden, Marijn Van Deun, Katrijn |
author_sort | Stoitsas, Kostas |
collection | PubMed |
description | Predicting recovery after trauma is important to provide patients a perspective on their estimated future health, to engage in shared decision making and target interventions to relevant patient groups. In the present study, several unsupervised techniques are employed to cluster patients based on longitudinal recovery profiles. Subsequently, these data-driven clusters were assessed on clinical validity by experts and used as targets in supervised machine learning models. We present a formalised analysis of the obtained clusters that incorporates evaluation of (i) statistical and machine learning metrics, (ii) clusters clinical validity with descriptive statistics and medical expertise. Clusters quality assessment revealed that clusters obtained through a Bayesian method (High Dimensional Supervised Classification and Clustering) and a Deep Gaussian Mixture model, in combination with oversampling and a Random Forest for supervised learning of the cluster assignments provided among the most clinically sensible partitioning of patients. Other methods that obtained higher classification accuracy suffered from cluster solutions with large majority classes or clinically less sensible classes. Models that used just physical or a mix of physical and psychological outcomes proved to be among the most sensible, suggesting that clustering on psychological outcomes alone yields recovery profiles that do not conform to known risk factors. |
format | Online Article Text |
id | pubmed-9550811 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-95508112022-10-12 Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery Stoitsas, Kostas Bahulikar, Saurabh de Munter, Leonie de Jongh, Mariska A. C. Jansen, Maria A. C. Jung, Merel M. van Wingerden, Marijn Van Deun, Katrijn Sci Rep Article Predicting recovery after trauma is important to provide patients a perspective on their estimated future health, to engage in shared decision making and target interventions to relevant patient groups. In the present study, several unsupervised techniques are employed to cluster patients based on longitudinal recovery profiles. Subsequently, these data-driven clusters were assessed on clinical validity by experts and used as targets in supervised machine learning models. We present a formalised analysis of the obtained clusters that incorporates evaluation of (i) statistical and machine learning metrics, (ii) clusters clinical validity with descriptive statistics and medical expertise. Clusters quality assessment revealed that clusters obtained through a Bayesian method (High Dimensional Supervised Classification and Clustering) and a Deep Gaussian Mixture model, in combination with oversampling and a Random Forest for supervised learning of the cluster assignments provided among the most clinically sensible partitioning of patients. Other methods that obtained higher classification accuracy suffered from cluster solutions with large majority classes or clinically less sensible classes. Models that used just physical or a mix of physical and psychological outcomes proved to be among the most sensible, suggesting that clustering on psychological outcomes alone yields recovery profiles that do not conform to known risk factors. Nature Publishing Group UK 2022-10-10 /pmc/articles/PMC9550811/ /pubmed/36216874 http://dx.doi.org/10.1038/s41598-022-21390-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Stoitsas, Kostas Bahulikar, Saurabh de Munter, Leonie de Jongh, Mariska A. C. Jansen, Maria A. C. Jung, Merel M. van Wingerden, Marijn Van Deun, Katrijn Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery |
title | Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery |
title_full | Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery |
title_fullStr | Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery |
title_full_unstemmed | Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery |
title_short | Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery |
title_sort | clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9550811/ https://www.ncbi.nlm.nih.gov/pubmed/36216874 http://dx.doi.org/10.1038/s41598-022-21390-2 |
work_keys_str_mv | AT stoitsaskostas clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery AT bahulikarsaurabh clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery AT demunterleonie clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery AT dejonghmariskaac clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery AT jansenmariaac clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery AT jungmerelm clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery AT vanwingerdenmarijn clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery AT vandeunkatrijn clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery |