Cargando…

Application of three statistical models for predicting the risk of diabetes

BACKGROUND: At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriou...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Siyu, Gao, Yue, Shen, Yuhang, Zhang, Min, Li, Jingjing, Sun, Pinghui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6878628/
https://www.ncbi.nlm.nih.gov/pubmed/31771577
http://dx.doi.org/10.1186/s12902-019-0456-2
_version_ 1783473486851735552
author Liu, Siyu
Gao, Yue
Shen, Yuhang
Zhang, Min
Li, Jingjing
Sun, Pinghui
author_facet Liu, Siyu
Gao, Yue
Shen, Yuhang
Zhang, Min
Li, Jingjing
Sun, Pinghui
author_sort Liu, Siyu
collection PubMed
description BACKGROUND: At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriously affect the quality of life of people with diabetes but also impose a heavy burden on families and society. Therefore, prevention and control of type 2 diabetes is of great significance. METHODS: We constructed a logistic regression model, a neural network model and a decision tree model to analyse the risk factors for type 2 diabetes and then compared the prediction accuracy of the different models by calculating the area under the relative operating characteristic (ROC) curve and back-inputting the data into the model. RESULTS: The prevalence of type 2 diabetes in 4177 subjects who were not diagnosed with type 2 diabetes was 9.31%. The most influential factors associated with type 2 diabetes were triglyceride (TG) ≥ 1.17 mmol/L (odds ratio (OR) =2.233), age ≥ 70 years (OR = 1.734), hypertension (OR = 1.703), alcohol consumption (OR = 1.674), and total cholesterol≥5.2 mmol/L (TC) (OR = 1.463). The prediction accuracies of the three prediction models were 90.8, 91.2, and 90.7%, respectively, and the areas under curve (AUCs) were 0.711, 0.780, and 0.698, respectively. The differences in the AUCs after back propagation (BP) of the neural network model, logistic regression model and decision tree model were statistically significant (P < 0.05). CONCLUSION: BP neural networks have a higher predictive power for identifying the associated risk factors of type 2 diabetes than the other two models, but it is necessary to select a suitable model for specific situations.
format Online
Article
Text
id pubmed-6878628
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68786282019-11-29 Application of three statistical models for predicting the risk of diabetes Liu, Siyu Gao, Yue Shen, Yuhang Zhang, Min Li, Jingjing Sun, Pinghui BMC Endocr Disord Research Article BACKGROUND: At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriously affect the quality of life of people with diabetes but also impose a heavy burden on families and society. Therefore, prevention and control of type 2 diabetes is of great significance. METHODS: We constructed a logistic regression model, a neural network model and a decision tree model to analyse the risk factors for type 2 diabetes and then compared the prediction accuracy of the different models by calculating the area under the relative operating characteristic (ROC) curve and back-inputting the data into the model. RESULTS: The prevalence of type 2 diabetes in 4177 subjects who were not diagnosed with type 2 diabetes was 9.31%. The most influential factors associated with type 2 diabetes were triglyceride (TG) ≥ 1.17 mmol/L (odds ratio (OR) =2.233), age ≥ 70 years (OR = 1.734), hypertension (OR = 1.703), alcohol consumption (OR = 1.674), and total cholesterol≥5.2 mmol/L (TC) (OR = 1.463). The prediction accuracies of the three prediction models were 90.8, 91.2, and 90.7%, respectively, and the areas under curve (AUCs) were 0.711, 0.780, and 0.698, respectively. The differences in the AUCs after back propagation (BP) of the neural network model, logistic regression model and decision tree model were statistically significant (P < 0.05). CONCLUSION: BP neural networks have a higher predictive power for identifying the associated risk factors of type 2 diabetes than the other two models, but it is necessary to select a suitable model for specific situations. BioMed Central 2019-11-26 /pmc/articles/PMC6878628/ /pubmed/31771577 http://dx.doi.org/10.1186/s12902-019-0456-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Liu, Siyu
Gao, Yue
Shen, Yuhang
Zhang, Min
Li, Jingjing
Sun, Pinghui
Application of three statistical models for predicting the risk of diabetes
title Application of three statistical models for predicting the risk of diabetes
title_full Application of three statistical models for predicting the risk of diabetes
title_fullStr Application of three statistical models for predicting the risk of diabetes
title_full_unstemmed Application of three statistical models for predicting the risk of diabetes
title_short Application of three statistical models for predicting the risk of diabetes
title_sort application of three statistical models for predicting the risk of diabetes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6878628/
https://www.ncbi.nlm.nih.gov/pubmed/31771577
http://dx.doi.org/10.1186/s12902-019-0456-2
work_keys_str_mv AT liusiyu applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes
AT gaoyue applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes
AT shenyuhang applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes
AT zhangmin applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes
AT lijingjing applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes
AT sunpinghui applicationofthreestatisticalmodelsforpredictingtheriskofdiabetes