Cargando…
Machine learning for predicting diabetes risk in western China adults
OBJECTIVE: Diabetes mellitus is a global epidemic disease. Long-time exposure of patients to hyperglycemia can lead to various type of chronic tissue damage. Early diagnosis of and screening for diabetes are crucial to population health. METHODS: We collected the national physical examination data i...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10373320/ https://www.ncbi.nlm.nih.gov/pubmed/37501094 http://dx.doi.org/10.1186/s13098-023-01112-y |
_version_ | 1785078543173025792 |
---|---|
author | Li, Lin Cheng, Yinlin Ji, Weidong Liu, Mimi Hu, Zhensheng Yang, Yining Wang, Yushan Zhou, Yi |
author_facet | Li, Lin Cheng, Yinlin Ji, Weidong Liu, Mimi Hu, Zhensheng Yang, Yining Wang, Yushan Zhou, Yi |
author_sort | Li, Lin |
collection | PubMed |
description | OBJECTIVE: Diabetes mellitus is a global epidemic disease. Long-time exposure of patients to hyperglycemia can lead to various type of chronic tissue damage. Early diagnosis of and screening for diabetes are crucial to population health. METHODS: We collected the national physical examination data in Xinjiang, China, in 2020 (a total of more than 4 million people). Three types of physical examination indices were analyzed: questionnaire, routine physical examination and laboratory values. Integrated learning, deep learning and logistic regression methods were used to establish a risk model for type-2 diabetes mellitus. In addition, to improve the convenience and flexibility of the model, a diabetes risk score card was established based on logistic regression to assess the risk of the population. RESULTS: An XGBoost-based risk prediction model outperformed the other five risk assessment algorithms. The AUC of the model was 0.9122. Based on the feature importance ranking map, we found that hypertension, fasting blood glucose, age, coronary heart disease, ethnicity, parental diabetes mellitus, triglycerides, waist circumference, total cholesterol, and body mass index were the most important features of the risk prediction model for type-2 diabetes. CONCLUSIONS: This study established a diabetes risk assessment model based on multiple ethnicities, a large sample and many indices, and classified the diabetes risk of the population, thus providing a new forecast tool for the screening of patients and providing information on diabetes prevention for healthy populations. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13098-023-01112-y. |
format | Online Article Text |
id | pubmed-10373320 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-103733202023-07-28 Machine learning for predicting diabetes risk in western China adults Li, Lin Cheng, Yinlin Ji, Weidong Liu, Mimi Hu, Zhensheng Yang, Yining Wang, Yushan Zhou, Yi Diabetol Metab Syndr Research OBJECTIVE: Diabetes mellitus is a global epidemic disease. Long-time exposure of patients to hyperglycemia can lead to various type of chronic tissue damage. Early diagnosis of and screening for diabetes are crucial to population health. METHODS: We collected the national physical examination data in Xinjiang, China, in 2020 (a total of more than 4 million people). Three types of physical examination indices were analyzed: questionnaire, routine physical examination and laboratory values. Integrated learning, deep learning and logistic regression methods were used to establish a risk model for type-2 diabetes mellitus. In addition, to improve the convenience and flexibility of the model, a diabetes risk score card was established based on logistic regression to assess the risk of the population. RESULTS: An XGBoost-based risk prediction model outperformed the other five risk assessment algorithms. The AUC of the model was 0.9122. Based on the feature importance ranking map, we found that hypertension, fasting blood glucose, age, coronary heart disease, ethnicity, parental diabetes mellitus, triglycerides, waist circumference, total cholesterol, and body mass index were the most important features of the risk prediction model for type-2 diabetes. CONCLUSIONS: This study established a diabetes risk assessment model based on multiple ethnicities, a large sample and many indices, and classified the diabetes risk of the population, thus providing a new forecast tool for the screening of patients and providing information on diabetes prevention for healthy populations. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13098-023-01112-y. BioMed Central 2023-07-27 /pmc/articles/PMC10373320/ /pubmed/37501094 http://dx.doi.org/10.1186/s13098-023-01112-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Li, Lin Cheng, Yinlin Ji, Weidong Liu, Mimi Hu, Zhensheng Yang, Yining Wang, Yushan Zhou, Yi Machine learning for predicting diabetes risk in western China adults |
title | Machine learning for predicting diabetes risk in western China adults |
title_full | Machine learning for predicting diabetes risk in western China adults |
title_fullStr | Machine learning for predicting diabetes risk in western China adults |
title_full_unstemmed | Machine learning for predicting diabetes risk in western China adults |
title_short | Machine learning for predicting diabetes risk in western China adults |
title_sort | machine learning for predicting diabetes risk in western china adults |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10373320/ https://www.ncbi.nlm.nih.gov/pubmed/37501094 http://dx.doi.org/10.1186/s13098-023-01112-y |
work_keys_str_mv | AT lilin machinelearningforpredictingdiabetesriskinwesternchinaadults AT chengyinlin machinelearningforpredictingdiabetesriskinwesternchinaadults AT jiweidong machinelearningforpredictingdiabetesriskinwesternchinaadults AT liumimi machinelearningforpredictingdiabetesriskinwesternchinaadults AT huzhensheng machinelearningforpredictingdiabetesriskinwesternchinaadults AT yangyining machinelearningforpredictingdiabetesriskinwesternchinaadults AT wangyushan machinelearningforpredictingdiabetesriskinwesternchinaadults AT zhouyi machinelearningforpredictingdiabetesriskinwesternchinaadults |