Cargando…
Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
BACKGROUND: Previous conventional models for the prediction of diabetes could be updated by incorporating the increasing amount of health data available and new risk prediction methodology. OBJECTIVE: We aimed to develop a substantially improved diabetes risk prediction model using sophisticated mac...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7420582/ https://www.ncbi.nlm.nih.gov/pubmed/32720912 http://dx.doi.org/10.2196/16850 |
_version_ | 1783569964590956544 |
---|---|
author | Zhang, Lei Shang, Xianwen Sreedharan, Subhashaan Yan, Xixi Liu, Jianbin Keel, Stuart Wu, Jinrong Peng, Wei He, Mingguang |
author_facet | Zhang, Lei Shang, Xianwen Sreedharan, Subhashaan Yan, Xixi Liu, Jianbin Keel, Stuart Wu, Jinrong Peng, Wei He, Mingguang |
author_sort | Zhang, Lei |
collection | PubMed |
description | BACKGROUND: Previous conventional models for the prediction of diabetes could be updated by incorporating the increasing amount of health data available and new risk prediction methodology. OBJECTIVE: We aimed to develop a substantially improved diabetes risk prediction model using sophisticated machine-learning algorithms based on a large retrospective population cohort of over 230,000 people who were enrolled in the study during 2006-2017. METHODS: We collected demographic, medical, behavioral, and incidence data for type 2 diabetes mellitus (T2DM) in over 236,684 diabetes-free participants recruited from the 45 and Up Study. We predicted and compared the risk of diabetes onset in these participants at 3, 5, 7, and 10 years based on three machine-learning approaches and the conventional regression model. RESULTS: Overall, 6.05% (14,313/236,684) of the participants developed T2DM during an average 8.8-year follow-up period. The 10-year diabetes incidence in men was 8.30% (8.08%-8.49%), which was significantly higher (odds ratio 1.37, 95% CI 1.32-1.41) than that in women at 6.20% (6.00%-6.40%). The incidence of T2DM was doubled in individuals with obesity (men: 17.78% [17.05%-18.43%]; women: 14.59% [13.99%-15.17%]) compared with that of nonobese individuals. The gradient boosting machine model showed the best performance among the four models (area under the curve of 79% in 3-year prediction and 75% in 10-year prediction). All machine-learning models predicted BMI as the most significant factor contributing to diabetes onset, which explained 12%-50% of the variance in the prediction of diabetes. The model predicted that if BMI in obese and overweight participants could be hypothetically reduced to a healthy range, the 10-year probability of diabetes onset would be significantly reduced from 8.3% to 2.8% (P<.001). CONCLUSIONS: A one-time self-reported survey can accurately predict the risk of diabetes using a machine-learning approach. Achieving a healthy BMI can significantly reduce the risk of developing T2DM. |
format | Online Article Text |
id | pubmed-7420582 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-74205822020-08-20 Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study Zhang, Lei Shang, Xianwen Sreedharan, Subhashaan Yan, Xixi Liu, Jianbin Keel, Stuart Wu, Jinrong Peng, Wei He, Mingguang JMIR Med Inform Original Paper BACKGROUND: Previous conventional models for the prediction of diabetes could be updated by incorporating the increasing amount of health data available and new risk prediction methodology. OBJECTIVE: We aimed to develop a substantially improved diabetes risk prediction model using sophisticated machine-learning algorithms based on a large retrospective population cohort of over 230,000 people who were enrolled in the study during 2006-2017. METHODS: We collected demographic, medical, behavioral, and incidence data for type 2 diabetes mellitus (T2DM) in over 236,684 diabetes-free participants recruited from the 45 and Up Study. We predicted and compared the risk of diabetes onset in these participants at 3, 5, 7, and 10 years based on three machine-learning approaches and the conventional regression model. RESULTS: Overall, 6.05% (14,313/236,684) of the participants developed T2DM during an average 8.8-year follow-up period. The 10-year diabetes incidence in men was 8.30% (8.08%-8.49%), which was significantly higher (odds ratio 1.37, 95% CI 1.32-1.41) than that in women at 6.20% (6.00%-6.40%). The incidence of T2DM was doubled in individuals with obesity (men: 17.78% [17.05%-18.43%]; women: 14.59% [13.99%-15.17%]) compared with that of nonobese individuals. The gradient boosting machine model showed the best performance among the four models (area under the curve of 79% in 3-year prediction and 75% in 10-year prediction). All machine-learning models predicted BMI as the most significant factor contributing to diabetes onset, which explained 12%-50% of the variance in the prediction of diabetes. The model predicted that if BMI in obese and overweight participants could be hypothetically reduced to a healthy range, the 10-year probability of diabetes onset would be significantly reduced from 8.3% to 2.8% (P<.001). CONCLUSIONS: A one-time self-reported survey can accurately predict the risk of diabetes using a machine-learning approach. Achieving a healthy BMI can significantly reduce the risk of developing T2DM. JMIR Publications 2020-07-28 /pmc/articles/PMC7420582/ /pubmed/32720912 http://dx.doi.org/10.2196/16850 Text en ©Lei Zhang, Xianwen Shang, Subhashaan Sreedharan, Xixi Yan, Jianbin Liu, Stuart Keel, Jinrong Wu, Wei Peng, Mingguang He. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 28.07.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Zhang, Lei Shang, Xianwen Sreedharan, Subhashaan Yan, Xixi Liu, Jianbin Keel, Stuart Wu, Jinrong Peng, Wei He, Mingguang Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study |
title | Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study |
title_full | Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study |
title_fullStr | Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study |
title_full_unstemmed | Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study |
title_short | Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study |
title_sort | predicting the development of type 2 diabetes in a large australian cohort using machine-learning techniques: longitudinal survey study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7420582/ https://www.ncbi.nlm.nih.gov/pubmed/32720912 http://dx.doi.org/10.2196/16850 |
work_keys_str_mv | AT zhanglei predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT shangxianwen predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT sreedharansubhashaan predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT yanxixi predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT liujianbin predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT keelstuart predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT wujinrong predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT pengwei predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy AT hemingguang predictingthedevelopmentoftype2diabetesinalargeaustraliancohortusingmachinelearningtechniqueslongitudinalsurveystudy |