Cargando…
Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study
AIMS/HYPOTHESIS: Research using data-driven cluster analysis has proposed five novel subgroups of diabetes based on six measured variables in individuals with newly diagnosed diabetes. Our aim was (1) to validate the existence of differing clusters within type 2 diabetes, and (2) to compare the clus...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8382658/ https://www.ncbi.nlm.nih.gov/pubmed/34059937 http://dx.doi.org/10.1007/s00125-021-05485-5 |
_version_ | 1783741579849105408 |
---|---|
author | Lugner, Moa Gudbjörnsdottir, Soffia Sattar, Naveed Svensson, Ann-Marie Miftaraj, Mervete Eeg-Olofsson, Katarina Eliasson, Björn Franzén, Stefan |
author_facet | Lugner, Moa Gudbjörnsdottir, Soffia Sattar, Naveed Svensson, Ann-Marie Miftaraj, Mervete Eeg-Olofsson, Katarina Eliasson, Björn Franzén, Stefan |
author_sort | Lugner, Moa |
collection | PubMed |
description | AIMS/HYPOTHESIS: Research using data-driven cluster analysis has proposed five novel subgroups of diabetes based on six measured variables in individuals with newly diagnosed diabetes. Our aim was (1) to validate the existence of differing clusters within type 2 diabetes, and (2) to compare the cluster method with an alternative strategy based on traditional methods to predict diabetes outcomes. METHODS: We used data from the Swedish National Diabetes Register and included 114,231 individuals with newly diagnosed type 2 diabetes. k-means clustering was used to identify clusters based on nine continuous variables (age at diagnosis, HbA(1c), BMI, systolic and diastolic BP, LDL- and HDL-cholesterol, triacylglycerol and eGFR). The elbow method was used to determine the optimal number of clusters and Cox regression models were used to evaluate mortality risk and risk of CVD events. The prediction models were compared using concordance statistics. RESULTS: The elbow plot, with values of k ranging from 1 to 10, showed a smooth curve without any clear cut-off points, making the optimal value of k unclear. The appearance of the plot was very similar to the elbow plot made from a simulated dataset consisting only of one cluster. In prediction models for mortality, concordance was 0.63 (95% CI 0.63, 0.64) for two clusters, 0.66 (95% CI 0.65, 0.66) for four clusters, 0.77 (95% CI 0.76, 0.77) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with smoothing splines. In prediction models for CVD events, the concordance was 0.64 (95% CI 0.63, 0.65) for two clusters, 0.66 (95% CI 0.65, 0.67) for four clusters, 0.77 (95% CI 0.77, 0.78) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with splines for all variables. CONCLUSIONS/INTERPRETATION: This nationwide observational study found no evidence supporting the existence of a specific number of distinct clusters within type 2 diabetes. The results from this study suggest that a prediction model approach using simple clinical features to predict risk of diabetes complications would be more useful than a cluster sub-stratification. GRAPHICAL ABSTRACT: [Image: see text] |
format | Online Article Text |
id | pubmed-8382658 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-83826582021-09-09 Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study Lugner, Moa Gudbjörnsdottir, Soffia Sattar, Naveed Svensson, Ann-Marie Miftaraj, Mervete Eeg-Olofsson, Katarina Eliasson, Björn Franzén, Stefan Diabetologia Article AIMS/HYPOTHESIS: Research using data-driven cluster analysis has proposed five novel subgroups of diabetes based on six measured variables in individuals with newly diagnosed diabetes. Our aim was (1) to validate the existence of differing clusters within type 2 diabetes, and (2) to compare the cluster method with an alternative strategy based on traditional methods to predict diabetes outcomes. METHODS: We used data from the Swedish National Diabetes Register and included 114,231 individuals with newly diagnosed type 2 diabetes. k-means clustering was used to identify clusters based on nine continuous variables (age at diagnosis, HbA(1c), BMI, systolic and diastolic BP, LDL- and HDL-cholesterol, triacylglycerol and eGFR). The elbow method was used to determine the optimal number of clusters and Cox regression models were used to evaluate mortality risk and risk of CVD events. The prediction models were compared using concordance statistics. RESULTS: The elbow plot, with values of k ranging from 1 to 10, showed a smooth curve without any clear cut-off points, making the optimal value of k unclear. The appearance of the plot was very similar to the elbow plot made from a simulated dataset consisting only of one cluster. In prediction models for mortality, concordance was 0.63 (95% CI 0.63, 0.64) for two clusters, 0.66 (95% CI 0.65, 0.66) for four clusters, 0.77 (95% CI 0.76, 0.77) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with smoothing splines. In prediction models for CVD events, the concordance was 0.64 (95% CI 0.63, 0.65) for two clusters, 0.66 (95% CI 0.65, 0.67) for four clusters, 0.77 (95% CI 0.77, 0.78) for the ordinary Cox model and 0.78 (95% CI 0.77, 0.78) for the Cox model with splines for all variables. CONCLUSIONS/INTERPRETATION: This nationwide observational study found no evidence supporting the existence of a specific number of distinct clusters within type 2 diabetes. The results from this study suggest that a prediction model approach using simple clinical features to predict risk of diabetes complications would be more useful than a cluster sub-stratification. GRAPHICAL ABSTRACT: [Image: see text] Springer Berlin Heidelberg 2021-05-31 2021 /pmc/articles/PMC8382658/ /pubmed/34059937 http://dx.doi.org/10.1007/s00125-021-05485-5 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Lugner, Moa Gudbjörnsdottir, Soffia Sattar, Naveed Svensson, Ann-Marie Miftaraj, Mervete Eeg-Olofsson, Katarina Eliasson, Björn Franzén, Stefan Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study |
title | Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study |
title_full | Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study |
title_fullStr | Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study |
title_full_unstemmed | Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study |
title_short | Comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study |
title_sort | comparison between data-driven clusters and models based on clinical features to predict outcomes in type 2 diabetes: nationwide observational study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8382658/ https://www.ncbi.nlm.nih.gov/pubmed/34059937 http://dx.doi.org/10.1007/s00125-021-05485-5 |
work_keys_str_mv | AT lugnermoa comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy AT gudbjornsdottirsoffia comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy AT sattarnaveed comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy AT svenssonannmarie comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy AT miftarajmervete comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy AT eegolofssonkatarina comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy AT eliassonbjorn comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy AT franzenstefan comparisonbetweendatadrivenclustersandmodelsbasedonclinicalfeaturestopredictoutcomesintype2diabetesnationwideobservationalstudy |