Cargando…

Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults

Purpose: We aimed to establish and validate a risk assessment system that combines demographic and clinical variables to predict the 3-year risk of incident diabetes in Chinese adults. Methods: A 3-year cohort study was performed on 15,928 Chinese adults without diabetes at baseline. All participant...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wu, Yang, Hu, Haofei, Cai, Jinlin, Chen, Runtian, Zuo, Xin, Cheng, Heng, Yan, Dewen
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Public Health
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275929/ https://www.ncbi.nlm.nih.gov/pubmed/34268283 http://dx.doi.org/10.3389/fpubh.2021.626331

_version_	1783721810582306816
author	Wu, Yang Hu, Haofei Cai, Jinlin Chen, Runtian Zuo, Xin Cheng, Heng Yan, Dewen
author_facet	Wu, Yang Hu, Haofei Cai, Jinlin Chen, Runtian Zuo, Xin Cheng, Heng Yan, Dewen
author_sort	Wu, Yang
collection	PubMed
description	Purpose: We aimed to establish and validate a risk assessment system that combines demographic and clinical variables to predict the 3-year risk of incident diabetes in Chinese adults. Methods: A 3-year cohort study was performed on 15,928 Chinese adults without diabetes at baseline. All participants were randomly divided into a training set (n = 7,940) and a validation set (n = 7,988). XGBoost method is an effective machine learning technique used to select the most important variables from candidate variables. And we further established a stepwise model based on the predictors chosen by the XGBoost model. The area under the receiver operating characteristic curve (AUC), decision curve and calibration analysis were used to assess discrimination, clinical use and calibration of the model, respectively. The external validation was performed on a cohort of 11,113 Japanese participants. Result: In the training and validation sets, 148 and 145 incident diabetes cases occurred. XGBoost methods selected the 10 most important variables from 15 candidate variables. Fasting plasma glucose (FPG), body mass index (BMI) and age were the top 3 important variables. And we further established a stepwise model and a prediction nomogram. The AUCs of the stepwise model were 0.933 and 0.910 in the training and validation sets, respectively. The Hosmer-Lemeshow test showed a perfect fit between the predicted diabetes risk and the observed diabetes risk (p = 0.068 for the training set, p = 0.165 for the validation set). Decision curve analysis presented the clinical use of the stepwise model and there was a wide range of alternative threshold probability spectrum. And there were almost no the interactions between these predictors (most P-values for interaction >0.05). Furthermore, the AUC for the external validation set was 0.830, and the Hosmer-Lemeshow test for the external validation set showed no statistically significant difference between the predicted diabetes risk and observed diabetes risk (P = 0.824). Conclusion: We established and validated a risk assessment system for characterizing the 3-year risk of incident diabetes.
format	Online Article Text
id	pubmed-8275929
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-82759292021-07-14 Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults Wu, Yang Hu, Haofei Cai, Jinlin Chen, Runtian Zuo, Xin Cheng, Heng Yan, Dewen Front Public Health Public Health Purpose: We aimed to establish and validate a risk assessment system that combines demographic and clinical variables to predict the 3-year risk of incident diabetes in Chinese adults. Methods: A 3-year cohort study was performed on 15,928 Chinese adults without diabetes at baseline. All participants were randomly divided into a training set (n = 7,940) and a validation set (n = 7,988). XGBoost method is an effective machine learning technique used to select the most important variables from candidate variables. And we further established a stepwise model based on the predictors chosen by the XGBoost model. The area under the receiver operating characteristic curve (AUC), decision curve and calibration analysis were used to assess discrimination, clinical use and calibration of the model, respectively. The external validation was performed on a cohort of 11,113 Japanese participants. Result: In the training and validation sets, 148 and 145 incident diabetes cases occurred. XGBoost methods selected the 10 most important variables from 15 candidate variables. Fasting plasma glucose (FPG), body mass index (BMI) and age were the top 3 important variables. And we further established a stepwise model and a prediction nomogram. The AUCs of the stepwise model were 0.933 and 0.910 in the training and validation sets, respectively. The Hosmer-Lemeshow test showed a perfect fit between the predicted diabetes risk and the observed diabetes risk (p = 0.068 for the training set, p = 0.165 for the validation set). Decision curve analysis presented the clinical use of the stepwise model and there was a wide range of alternative threshold probability spectrum. And there were almost no the interactions between these predictors (most P-values for interaction >0.05). Furthermore, the AUC for the external validation set was 0.830, and the Hosmer-Lemeshow test for the external validation set showed no statistically significant difference between the predicted diabetes risk and observed diabetes risk (P = 0.824). Conclusion: We established and validated a risk assessment system for characterizing the 3-year risk of incident diabetes. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8275929/ /pubmed/34268283 http://dx.doi.org/10.3389/fpubh.2021.626331 Text en Copyright © 2021 Wu, Hu, Cai, Chen, Zuo, Cheng and Yan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Public Health Wu, Yang Hu, Haofei Cai, Jinlin Chen, Runtian Zuo, Xin Cheng, Heng Yan, Dewen Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults
title	Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults
title_full	Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults
title_fullStr	Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults
title_full_unstemmed	Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults
title_short	Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults
title_sort	machine learning for predicting the 3-year risk of incident diabetes in chinese adults
topic	Public Health
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275929/ https://www.ncbi.nlm.nih.gov/pubmed/34268283 http://dx.doi.org/10.3389/fpubh.2021.626331
work_keys_str_mv	AT wuyang machinelearningforpredictingthe3yearriskofincidentdiabetesinchineseadults AT huhaofei machinelearningforpredictingthe3yearriskofincidentdiabetesinchineseadults AT caijinlin machinelearningforpredictingthe3yearriskofincidentdiabetesinchineseadults AT chenruntian machinelearningforpredictingthe3yearriskofincidentdiabetesinchineseadults AT zuoxin machinelearningforpredictingthe3yearriskofincidentdiabetesinchineseadults AT chengheng machinelearningforpredictingthe3yearriskofincidentdiabetesinchineseadults AT yandewen machinelearningforpredictingthe3yearriskofincidentdiabetesinchineseadults

Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults

Ejemplares similares