Cargando…

Machine learning for predicting diabetes risk in western China adults

OBJECTIVE: Diabetes mellitus is a global epidemic disease. Long-time exposure of patients to hyperglycemia can lead to various type of chronic tissue damage. Early diagnosis of and screening for diabetes are crucial to population health. METHODS: We collected the national physical examination data i...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Lin, Cheng, Yinlin, Ji, Weidong, Liu, Mimi, Hu, Zhensheng, Yang, Yining, Wang, Yushan, Zhou, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10373320/
https://www.ncbi.nlm.nih.gov/pubmed/37501094
http://dx.doi.org/10.1186/s13098-023-01112-y
_version_ 1785078543173025792
author Li, Lin
Cheng, Yinlin
Ji, Weidong
Liu, Mimi
Hu, Zhensheng
Yang, Yining
Wang, Yushan
Zhou, Yi
author_facet Li, Lin
Cheng, Yinlin
Ji, Weidong
Liu, Mimi
Hu, Zhensheng
Yang, Yining
Wang, Yushan
Zhou, Yi
author_sort Li, Lin
collection PubMed
description OBJECTIVE: Diabetes mellitus is a global epidemic disease. Long-time exposure of patients to hyperglycemia can lead to various type of chronic tissue damage. Early diagnosis of and screening for diabetes are crucial to population health. METHODS: We collected the national physical examination data in Xinjiang, China, in 2020 (a total of more than 4 million people). Three types of physical examination indices were analyzed: questionnaire, routine physical examination and laboratory values. Integrated learning, deep learning and logistic regression methods were used to establish a risk model for type-2 diabetes mellitus. In addition, to improve the convenience and flexibility of the model, a diabetes risk score card was established based on logistic regression to assess the risk of the population. RESULTS: An XGBoost-based risk prediction model outperformed the other five risk assessment algorithms. The AUC of the model was 0.9122. Based on the feature importance ranking map, we found that hypertension, fasting blood glucose, age, coronary heart disease, ethnicity, parental diabetes mellitus, triglycerides, waist circumference, total cholesterol, and body mass index were the most important features of the risk prediction model for type-2 diabetes. CONCLUSIONS: This study established a diabetes risk assessment model based on multiple ethnicities, a large sample and many indices, and classified the diabetes risk of the population, thus providing a new forecast tool for the screening of patients and providing information on diabetes prevention for healthy populations. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13098-023-01112-y.
format Online
Article
Text
id pubmed-10373320
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103733202023-07-28 Machine learning for predicting diabetes risk in western China adults Li, Lin Cheng, Yinlin Ji, Weidong Liu, Mimi Hu, Zhensheng Yang, Yining Wang, Yushan Zhou, Yi Diabetol Metab Syndr Research OBJECTIVE: Diabetes mellitus is a global epidemic disease. Long-time exposure of patients to hyperglycemia can lead to various type of chronic tissue damage. Early diagnosis of and screening for diabetes are crucial to population health. METHODS: We collected the national physical examination data in Xinjiang, China, in 2020 (a total of more than 4 million people). Three types of physical examination indices were analyzed: questionnaire, routine physical examination and laboratory values. Integrated learning, deep learning and logistic regression methods were used to establish a risk model for type-2 diabetes mellitus. In addition, to improve the convenience and flexibility of the model, a diabetes risk score card was established based on logistic regression to assess the risk of the population. RESULTS: An XGBoost-based risk prediction model outperformed the other five risk assessment algorithms. The AUC of the model was 0.9122. Based on the feature importance ranking map, we found that hypertension, fasting blood glucose, age, coronary heart disease, ethnicity, parental diabetes mellitus, triglycerides, waist circumference, total cholesterol, and body mass index were the most important features of the risk prediction model for type-2 diabetes. CONCLUSIONS: This study established a diabetes risk assessment model based on multiple ethnicities, a large sample and many indices, and classified the diabetes risk of the population, thus providing a new forecast tool for the screening of patients and providing information on diabetes prevention for healthy populations. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13098-023-01112-y. BioMed Central 2023-07-27 /pmc/articles/PMC10373320/ /pubmed/37501094 http://dx.doi.org/10.1186/s13098-023-01112-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Li, Lin
Cheng, Yinlin
Ji, Weidong
Liu, Mimi
Hu, Zhensheng
Yang, Yining
Wang, Yushan
Zhou, Yi
Machine learning for predicting diabetes risk in western China adults
title Machine learning for predicting diabetes risk in western China adults
title_full Machine learning for predicting diabetes risk in western China adults
title_fullStr Machine learning for predicting diabetes risk in western China adults
title_full_unstemmed Machine learning for predicting diabetes risk in western China adults
title_short Machine learning for predicting diabetes risk in western China adults
title_sort machine learning for predicting diabetes risk in western china adults
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10373320/
https://www.ncbi.nlm.nih.gov/pubmed/37501094
http://dx.doi.org/10.1186/s13098-023-01112-y
work_keys_str_mv AT lilin machinelearningforpredictingdiabetesriskinwesternchinaadults
AT chengyinlin machinelearningforpredictingdiabetesriskinwesternchinaadults
AT jiweidong machinelearningforpredictingdiabetesriskinwesternchinaadults
AT liumimi machinelearningforpredictingdiabetesriskinwesternchinaadults
AT huzhensheng machinelearningforpredictingdiabetesriskinwesternchinaadults
AT yangyining machinelearningforpredictingdiabetesriskinwesternchinaadults
AT wangyushan machinelearningforpredictingdiabetesriskinwesternchinaadults
AT zhouyi machinelearningforpredictingdiabetesriskinwesternchinaadults