Cargando…
Application of three statistical models for predicting the risk of diabetes
BACKGROUND: At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriou...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6878628/ https://www.ncbi.nlm.nih.gov/pubmed/31771577 http://dx.doi.org/10.1186/s12902-019-0456-2 |
_version_ | 1783473486851735552 |
---|---|
author | Liu, Siyu Gao, Yue Shen, Yuhang Zhang, Min Li, Jingjing Sun, Pinghui |
author_facet | Liu, Siyu Gao, Yue Shen, Yuhang Zhang, Min Li, Jingjing Sun, Pinghui |
author_sort | Liu, Siyu |
collection | PubMed |
description | BACKGROUND: At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriously affect the quality of life of people with diabetes but also impose a heavy burden on families and society. Therefore, prevention and control of type 2 diabetes is of great significance. METHODS: We constructed a logistic regression model, a neural network model and a decision tree model to analyse the risk factors for type 2 diabetes and then compared the prediction accuracy of the different models by calculating the area under the relative operating characteristic (ROC) curve and back-inputting the data into the model. RESULTS: The prevalence of type 2 diabetes in 4177 subjects who were not diagnosed with type 2 diabetes was 9.31%. The most influential factors associated with type 2 diabetes were triglyceride (TG) ≥ 1.17 mmol/L (odds ratio (OR) =2.233), age ≥ 70 years (OR = 1.734), hypertension (OR = 1.703), alcohol consumption (OR = 1.674), and total cholesterol≥5.2 mmol/L (TC) (OR = 1.463). The prediction accuracies of the three prediction models were 90.8, 91.2, and 90.7%, respectively, and the areas under curve (AUCs) were 0.711, 0.780, and 0.698, respectively. The differences in the AUCs after back propagation (BP) of the neural network model, logistic regression model and decision tree model were statistically significant (P < 0.05). CONCLUSION: BP neural networks have a higher predictive power for identifying the associated risk factors of type 2 diabetes than the other two models, but it is necessary to select a suitable model for specific situations. |
format | Online Article Text |
id | pubmed-6878628 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68786282019-11-29 Application of three statistical models for predicting the risk of diabetes Liu, Siyu Gao, Yue Shen, Yuhang Zhang, Min Li, Jingjing Sun, Pinghui BMC Endocr Disord Research Article BACKGROUND: At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriously affect the quality of life of people with diabetes but also impose a heavy burden on families and society. Therefore, prevention and control of type 2 diabetes is of great significance. METHODS: We constructed a logistic regression model, a neural network model and a decision tree model to analyse the risk factors for type 2 diabetes and then compared the prediction accuracy of the different models by calculating the area under the relative operating characteristic (ROC) curve and back-inputting the data into the model. RESULTS: The prevalence of type 2 diabetes in 4177 subjects who were not diagnosed with type 2 diabetes was 9.31%. The most influential factors associated with type 2 diabetes were triglyceride (TG) ≥ 1.17 mmol/L (odds ratio (OR) =2.233), age ≥ 70 years (OR = 1.734), hypertension (OR = 1.703), alcohol consumption (OR = 1.674), and total cholesterol≥5.2 mmol/L (TC) (OR = 1.463). The prediction accuracies of the three prediction models were 90.8, 91.2, and 90.7%, respectively, and the areas under curve (AUCs) were 0.711, 0.780, and 0.698, respectively. The differences in the AUCs after back propagation (BP) of the neural network model, logistic regression model and decision tree model were statistically significant (P < 0.05). CONCLUSION: BP neural networks have a higher predictive power for identifying the associated risk factors of type 2 diabetes than the other two models, but it is necessary to select a suitable model for specific situations. BioMed Central 2019-11-26 /pmc/articles/PMC6878628/ /pubmed/31771577 http://dx.doi.org/10.1186/s12902-019-0456-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Liu, Siyu Gao, Yue Shen, Yuhang Zhang, Min Li, Jingjing Sun, Pinghui Application of three statistical models for predicting the risk of diabetes |
title | Application of three statistical models for predicting the risk of diabetes |
title_full | Application of three statistical models for predicting the risk of diabetes |
title_fullStr | Application of three statistical models for predicting the risk of diabetes |
title_full_unstemmed | Application of three statistical models for predicting the risk of diabetes |
title_short | Application of three statistical models for predicting the risk of diabetes |
title_sort | application of three statistical models for predicting the risk of diabetes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6878628/ https://www.ncbi.nlm.nih.gov/pubmed/31771577 http://dx.doi.org/10.1186/s12902-019-0456-2 |
work_keys_str_mv | AT liusiyu applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes AT gaoyue applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes AT shenyuhang applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes AT zhangmin applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes AT lijingjing applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes AT sunpinghui applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes |