Cargando…

Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study

AIMS/HYPOTHESIS: Research using data-driven cluster analysis has proposed five novel subgroups of diabetes based on six measured variables in individuals with newly diagnosed diabetes. Our aim was (1) to validate the existence of differing clusters within type 2 diabetes, and (2) to compare the clus...

Descripción completa

Detalles Bibliográficos
Autores principales: Lugner, Moa, Gudbjörnsdottir, Soffia, Sattar, Naveed, Svensson, Ann-Marie, Miftaraj, Mervete, Eeg-Olofsson, Katarina, Eliasson, Björn, Franzén, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8382658/
https://www.ncbi.nlm.nih.gov/pubmed/34059937
http://dx.doi.org/10.1007/s00125-021-05485-5
_version_ 1783741579849105408
author Lugner, Moa
Gudbjörnsdottir, Soffia
Sattar, Naveed
Svensson, Ann-Marie
Miftaraj, Mervete
Eeg-Olofsson, Katarina
Eliasson, Björn
Franzén, Stefan
author_facet Lugner, Moa
Gudbjörnsdottir, Soffia
Sattar, Naveed
Svensson, Ann-Marie
Miftaraj, Mervete
Eeg-Olofsson, Katarina
Eliasson, Björn
Franzén, Stefan
author_sort Lugner, Moa
collection PubMed
description AIMS/HYPOTHESIS: Research using data-driven cluster analysis has proposed five novel subgroups of diabetes based on six measured variables in individuals with newly diagnosed diabetes. Our aim was (1) to validate the existence of differing clusters within type 2 diabetes, and (2) to compare the cluster method with an alternative strategy based on traditional methods to predict diabetes outcomes. METHODS: We used data from the Swedish National Diabetes Register and included 114,231 individuals with newly diagnosed type 2 diabetes. k-means clustering was used to identify clusters based on nine continuous variables (age at diagnosis, HbA(1c), BMI, systolic and diastolic BP, LDL- and HDL-cholesterol, triacylglycerol and eGFR). The elbow method was used to determine the optimal number of clusters and Cox regression models were used to evaluate mortality risk and risk of CVD events. The prediction models were compared using concordance statistics. RESULTS: The elbow plot, with values of k ranging from 1 to 10, showed a smooth curve without any clear cut-off points, making the optimal value of k unclear. The appearance of the plot was very similar to the elbow plot made from a simulated dataset consisting only of one cluster. In prediction models for mortality, concordance was 0.63 (95% CI 0.63, 0.64) for two clusters, 0.66 (95% CI 0.65, 0.66) for four clusters, 0.77 (95% CI 0.76, 0.77) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with smoothing splines. In prediction models for CVD events, the concordance was 0.64 (95% CI 0.63, 0.65) for two clusters, 0.66 (95% CI 0.65, 0.67) for four clusters, 0.77 (95% CI 0.77, 0.78) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with splines for all variables. CONCLUSIONS/INTERPRETATION: This nationwide observational study found no evidence supporting the existence of a specific number of distinct clusters within type 2 diabetes. The results from this study suggest that a prediction model approach using simple clinical features to predict risk of diabetes complications would be more useful than a cluster sub-stratification. GRAPHICAL ABSTRACT: [Image: see text]
format Online
Article
Text
id pubmed-8382658
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-83826582021-09-09 Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study Lugner, Moa Gudbjörnsdottir, Soffia Sattar, Naveed Svensson, Ann-Marie Miftaraj, Mervete Eeg-Olofsson, Katarina Eliasson, Björn Franzén, Stefan Diabetologia Article AIMS/HYPOTHESIS: Research using data-driven cluster analysis has proposed five novel subgroups of diabetes based on six measured variables in individuals with newly diagnosed diabetes. Our aim was (1) to validate the existence of differing clusters within type 2 diabetes, and (2) to compare the cluster method with an alternative strategy based on traditional methods to predict diabetes outcomes. METHODS: We used data from the Swedish National Diabetes Register and included 114,231 individuals with newly diagnosed type 2 diabetes. k-means clustering was used to identify clusters based on nine continuous variables (age at diagnosis, HbA(1c), BMI, systolic and diastolic BP, LDL- and HDL-cholesterol, triacylglycerol and eGFR). The elbow method was used to determine the optimal number of clusters and Cox regression models were used to evaluate mortality risk and risk of CVD events. The prediction models were compared using concordance statistics. RESULTS: The elbow plot, with values of k ranging from 1 to 10, showed a smooth curve without any clear cut-off points, making the optimal value of k unclear. The appearance of the plot was very similar to the elbow plot made from a simulated dataset consisting only of one cluster. In prediction models for mortality, concordance was 0.63 (95% CI 0.63, 0.64) for two clusters, 0.66 (95% CI 0.65, 0.66) for four clusters, 0.77 (95% CI 0.76, 0.77) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with smoothing splines. In prediction models for CVD events, the concordance was 0.64 (95% CI 0.63, 0.65) for two clusters, 0.66 (95% CI 0.65, 0.67) for four clusters, 0.77 (95% CI 0.77, 0.78) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with splines for all variables. CONCLUSIONS/INTERPRETATION: This nationwide observational study found no evidence supporting the existence of a specific number of distinct clusters within type 2 diabetes. The results from this study suggest that a prediction model approach using simple clinical features to predict risk of diabetes complications would be more useful than a cluster sub-stratification. GRAPHICAL ABSTRACT: [Image: see text] Springer Berlin Heidelberg 2021-05-31 2021 /pmc/articles/PMC8382658/ /pubmed/34059937 http://dx.doi.org/10.1007/s00125-021-05485-5 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Lugner, Moa
Gudbjörnsdottir, Soffia
Sattar, Naveed
Svensson, Ann-Marie
Miftaraj, Mervete
Eeg-Olofsson, Katarina
Eliasson, Björn
Franzén, Stefan
Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
title Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
title_full Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
title_fullStr Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
title_full_unstemmed Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
title_short Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
title_sort comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8382658/
https://www.ncbi.nlm.nih.gov/pubmed/34059937
http://dx.doi.org/10.1007/s00125-021-05485-5
work_keys_str_mv AT lugnermoa comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy
AT gudbjornsdottirsoffia comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy
AT sattarnaveed comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy
AT svenssonannmarie comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy
AT miftarajmervete comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy
AT eegolofssonkatarina comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy
AT eliassonbjorn comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy
AT franzenstefan comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy