Cargando…

Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery

Predicting recovery after trauma is important to provide patients a perspective on their estimated future health, to engage in shared decision making and target interventions to relevant patient groups. In the present study, several unsupervised techniques are employed to cluster patients based on l...

Descripción completa

Detalles Bibliográficos
Autores principales: Stoitsas, Kostas, Bahulikar, Saurabh, de Munter, Leonie, de Jongh, Mariska A. C., Jansen, Maria A. C., Jung, Merel M., van Wingerden, Marijn, Van Deun, Katrijn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9550811/
https://www.ncbi.nlm.nih.gov/pubmed/36216874
http://dx.doi.org/10.1038/s41598-022-21390-2
_version_ 1784805965927809024
author Stoitsas, Kostas
Bahulikar, Saurabh
de Munter, Leonie
de Jongh, Mariska A. C.
Jansen, Maria A. C.
Jung, Merel M.
van Wingerden, Marijn
Van Deun, Katrijn
author_facet Stoitsas, Kostas
Bahulikar, Saurabh
de Munter, Leonie
de Jongh, Mariska A. C.
Jansen, Maria A. C.
Jung, Merel M.
van Wingerden, Marijn
Van Deun, Katrijn
author_sort Stoitsas, Kostas
collection PubMed
description Predicting recovery after trauma is important to provide patients a perspective on their estimated future health, to engage in shared decision making and target interventions to relevant patient groups. In the present study, several unsupervised techniques are employed to cluster patients based on longitudinal recovery profiles. Subsequently, these data-driven clusters were assessed on clinical validity by experts and used as targets in supervised machine learning models. We present a formalised analysis of the obtained clusters that incorporates evaluation of (i) statistical and machine learning metrics, (ii) clusters clinical validity with descriptive statistics and medical expertise. Clusters quality assessment revealed that clusters obtained through a Bayesian method (High Dimensional Supervised Classification and Clustering) and a Deep Gaussian Mixture model, in combination with oversampling and a Random Forest for supervised learning of the cluster assignments provided among the most clinically sensible partitioning of patients. Other methods that obtained higher classification accuracy suffered from cluster solutions with large majority classes or clinically less sensible classes. Models that used just physical or a mix of physical and psychological outcomes proved to be among the most sensible, suggesting that clustering on psychological outcomes alone yields recovery profiles that do not conform to known risk factors.
format Online
Article
Text
id pubmed-9550811
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95508112022-10-12 Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery Stoitsas, Kostas Bahulikar, Saurabh de Munter, Leonie de Jongh, Mariska A. C. Jansen, Maria A. C. Jung, Merel M. van Wingerden, Marijn Van Deun, Katrijn Sci Rep Article Predicting recovery after trauma is important to provide patients a perspective on their estimated future health, to engage in shared decision making and target interventions to relevant patient groups. In the present study, several unsupervised techniques are employed to cluster patients based on longitudinal recovery profiles. Subsequently, these data-driven clusters were assessed on clinical validity by experts and used as targets in supervised machine learning models. We present a formalised analysis of the obtained clusters that incorporates evaluation of (i) statistical and machine learning metrics, (ii) clusters clinical validity with descriptive statistics and medical expertise. Clusters quality assessment revealed that clusters obtained through a Bayesian method (High Dimensional Supervised Classification and Clustering) and a Deep Gaussian Mixture model, in combination with oversampling and a Random Forest for supervised learning of the cluster assignments provided among the most clinically sensible partitioning of patients. Other methods that obtained higher classification accuracy suffered from cluster solutions with large majority classes or clinically less sensible classes. Models that used just physical or a mix of physical and psychological outcomes proved to be among the most sensible, suggesting that clustering on psychological outcomes alone yields recovery profiles that do not conform to known risk factors. Nature Publishing Group UK 2022-10-10 /pmc/articles/PMC9550811/ /pubmed/36216874 http://dx.doi.org/10.1038/s41598-022-21390-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Stoitsas, Kostas
Bahulikar, Saurabh
de Munter, Leonie
de Jongh, Mariska A. C.
Jansen, Maria A. C.
Jung, Merel M.
van Wingerden, Marijn
Van Deun, Katrijn
Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
title Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
title_full Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
title_fullStr Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
title_full_unstemmed Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
title_short Clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
title_sort clustering of trauma patients based on longitudinal data and the application of machine learning to predict recovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9550811/
https://www.ncbi.nlm.nih.gov/pubmed/36216874
http://dx.doi.org/10.1038/s41598-022-21390-2
work_keys_str_mv AT stoitsaskostas clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery
AT bahulikarsaurabh clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery
AT demunterleonie clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery
AT dejonghmariskaac clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery
AT jansenmariaac clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery
AT jungmerelm clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery
AT vanwingerdenmarijn clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery
AT vandeunkatrijn clusteringoftraumapatientsbasedonlongitudinaldataandtheapplicationofmachinelearningtopredictrecovery