Cargando…

Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study

BACKGROUND: Previous conventional models for the prediction of diabetes could be updated by incorporating the increasing amount of health data available and new risk prediction methodology. OBJECTIVE: We aimed to develop a substantially improved diabetes risk prediction model using sophisticated mac...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Lei, Shang, Xianwen, Sreedharan, Subhashaan, Yan, Xixi, Liu, Jianbin, Keel, Stuart, Wu, Jinrong, Peng, Wei, He, Mingguang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7420582/
https://www.ncbi.nlm.nih.gov/pubmed/32720912
http://dx.doi.org/10.2196/16850
_version_ 1783569964590956544
author Zhang, Lei
Shang, Xianwen
Sreedharan, Subhashaan
Yan, Xixi
Liu, Jianbin
Keel, Stuart
Wu, Jinrong
Peng, Wei
He, Mingguang
author_facet Zhang, Lei
Shang, Xianwen
Sreedharan, Subhashaan
Yan, Xixi
Liu, Jianbin
Keel, Stuart
Wu, Jinrong
Peng, Wei
He, Mingguang
author_sort Zhang, Lei
collection PubMed
description BACKGROUND: Previous conventional models for the prediction of diabetes could be updated by incorporating the increasing amount of health data available and new risk prediction methodology. OBJECTIVE: We aimed to develop a substantially improved diabetes risk prediction model using sophisticated machine-learning algorithms based on a large retrospective population cohort of over 230,000 people who were enrolled in the study during 2006-2017. METHODS: We collected demographic, medical, behavioral, and incidence data for type 2 diabetes mellitus (T2DM) in over 236,684 diabetes-free participants recruited from the 45 and Up Study. We predicted and compared the risk of diabetes onset in these participants at 3, 5, 7, and 10 years based on three machine-learning approaches and the conventional regression model. RESULTS: Overall, 6.05% (14,313/236,684) of the participants developed T2DM during an average 8.8-year follow-up period. The 10-year diabetes incidence in men was 8.30% (8.08%-8.49%), which was significantly higher (odds ratio 1.37, 95% CI 1.32-1.41) than that in women at 6.20% (6.00%-6.40%). The incidence of T2DM was doubled in individuals with obesity (men: 17.78% [17.05%-18.43%]; women: 14.59% [13.99%-15.17%]) compared with that of nonobese individuals. The gradient boosting machine model showed the best performance among the four models (area under the curve of 79% in 3-year prediction and 75% in 10-year prediction). All machine-learning models predicted BMI as the most significant factor contributing to diabetes onset, which explained 12%-50% of the variance in the prediction of diabetes. The model predicted that if BMI in obese and overweight participants could be hypothetically reduced to a healthy range, the 10-year probability of diabetes onset would be significantly reduced from 8.3% to 2.8% (P<.001). CONCLUSIONS: A one-time self-reported survey can accurately predict the risk of diabetes using a machine-learning approach. Achieving a healthy BMI can significantly reduce the risk of developing T2DM.
format Online
Article
Text
id pubmed-7420582
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-74205822020-08-20 Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study Zhang, Lei Shang, Xianwen Sreedharan, Subhashaan Yan, Xixi Liu, Jianbin Keel, Stuart Wu, Jinrong Peng, Wei He, Mingguang JMIR Med Inform Original Paper BACKGROUND: Previous conventional models for the prediction of diabetes could be updated by incorporating the increasing amount of health data available and new risk prediction methodology. OBJECTIVE: We aimed to develop a substantially improved diabetes risk prediction model using sophisticated machine-learning algorithms based on a large retrospective population cohort of over 230,000 people who were enrolled in the study during 2006-2017. METHODS: We collected demographic, medical, behavioral, and incidence data for type 2 diabetes mellitus (T2DM) in over 236,684 diabetes-free participants recruited from the 45 and Up Study. We predicted and compared the risk of diabetes onset in these participants at 3, 5, 7, and 10 years based on three machine-learning approaches and the conventional regression model. RESULTS: Overall, 6.05% (14,313/236,684) of the participants developed T2DM during an average 8.8-year follow-up period. The 10-year diabetes incidence in men was 8.30% (8.08%-8.49%), which was significantly higher (odds ratio 1.37, 95% CI 1.32-1.41) than that in women at 6.20% (6.00%-6.40%). The incidence of T2DM was doubled in individuals with obesity (men: 17.78% [17.05%-18.43%]; women: 14.59% [13.99%-15.17%]) compared with that of nonobese individuals. The gradient boosting machine model showed the best performance among the four models (area under the curve of 79% in 3-year prediction and 75% in 10-year prediction). All machine-learning models predicted BMI as the most significant factor contributing to diabetes onset, which explained 12%-50% of the variance in the prediction of diabetes. The model predicted that if BMI in obese and overweight participants could be hypothetically reduced to a healthy range, the 10-year probability of diabetes onset would be significantly reduced from 8.3% to 2.8% (P<.001). CONCLUSIONS: A one-time self-reported survey can accurately predict the risk of diabetes using a machine-learning approach. Achieving a healthy BMI can significantly reduce the risk of developing T2DM. JMIR Publications 2020-07-28 /pmc/articles/PMC7420582/ /pubmed/32720912 http://dx.doi.org/10.2196/16850 Text en ©Lei Zhang, Xianwen Shang, Subhashaan Sreedharan, Xixi Yan, Jianbin Liu, Stuart Keel, Jinrong Wu, Wei Peng, Mingguang He. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 28.07.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Zhang, Lei
Shang, Xianwen
Sreedharan, Subhashaan
Yan, Xixi
Liu, Jianbin
Keel, Stuart
Wu, Jinrong
Peng, Wei
He, Mingguang
Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
title Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
title_full Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
title_fullStr Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
title_full_unstemmed Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
title_short Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
title_sort predicting the development of type 2 diabetes in a large australian cohort using machine-learning techniques: longitudinal survey study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7420582/
https://www.ncbi.nlm.nih.gov/pubmed/32720912
http://dx.doi.org/10.2196/16850
work_keys_str_mv AT zhanglei predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT shangxianwen predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT sreedharansubhashaan predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT yanxixi predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT liujianbin predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT keelstuart predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT wujinrong predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT pengwei predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy
AT hemingguang predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy