Cargando…

Detection of diabetic patients in people with normal fasting glucose using machine learning

BACKGROUND: Diabetes mellitus (DM) is a chronic metabolic disease that could produce severe complications threatening life. Its early detection is thus quite important for the timely prevention and treatment. Normally, fasting blood glucose (FBG) by physical examination is used for large-scale scree...

Descripción completa

Detalles Bibliográficos
Autores principales: Lv, Kun, Cui, Chunmei, Fan, Rui, Zha, Xiaojuan, Wang, Pengyu, Zhang, Jun, Zhang, Lina, Ke, Jing, Zhao, Dong, Cui, Qinghua, Yang, Liming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483877/
https://www.ncbi.nlm.nih.gov/pubmed/37674168
http://dx.doi.org/10.1186/s12916-023-03045-9
_version_ 1785102480615407616
author Lv, Kun
Cui, Chunmei
Fan, Rui
Zha, Xiaojuan
Wang, Pengyu
Zhang, Jun
Zhang, Lina
Ke, Jing
Zhao, Dong
Cui, Qinghua
Yang, Liming
author_facet Lv, Kun
Cui, Chunmei
Fan, Rui
Zha, Xiaojuan
Wang, Pengyu
Zhang, Jun
Zhang, Lina
Ke, Jing
Zhao, Dong
Cui, Qinghua
Yang, Liming
author_sort Lv, Kun
collection PubMed
description BACKGROUND: Diabetes mellitus (DM) is a chronic metabolic disease that could produce severe complications threatening life. Its early detection is thus quite important for the timely prevention and treatment. Normally, fasting blood glucose (FBG) by physical examination is used for large-scale screening of DM; however, some people with normal fasting glucose (NFG) actually have suffered from diabetes but are missed by the examination. This study aimed to investigate whether common physical examination indexes for diabetes can be used to identify the diabetes individuals from the populations with NFG. METHODS: The physical examination data from over 60,000 individuals with NFG in three Chinese cohorts were used. The diabetes patients were defined by HbA1c ≥ 48 mmol/mol (6.5%). We constructed the models using multiple machine learning methods, including logistic regression, random forest, deep neural network, and support vector machine, and selected the optimal one on the validation set. A framework using permutation feature importance algorithm was devised to discover the personalized risk factors. RESULTS: The prediction model constructed by logistic regression achieved the best performance with an AUC, sensitivity, and specificity of 0.899, 85.0%, and 81.1% on the validation set and 0.872, 77.9%, and 81.0% on the test set, respectively. Following feature selection, the final classifier only requiring 13 features, named as DRING (diabetes risk of individuals with normal fasting glucose), exhibited reliable performance on two newly recruited independent datasets, with the AUC of 0.964 and 0.899, the balanced accuracy of 84.2% and 81.1%, the sensitivity of 100% and 76.2%, and the specificity of 68.3% and 86.0%, respectively. The feature importance ranking analysis revealed that BMI, age, sex, absolute lymphocyte count, and mean corpuscular volume are important factors for the risk stratification of diabetes. With a case, the framework for identifying personalized risk factors revealed FBG, age, and BMI as significant hazard factors that contribute to an increased incidence of diabetes. DRING webserver is available for ease of application (http://www.cuilab.cn/dring). CONCLUSIONS: DRING was demonstrated to perform well on identifying the diabetes individuals among populations with NFG, which could aid in early diagnosis and interventions for those individuals who are most likely missed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12916-023-03045-9.
format Online
Article
Text
id pubmed-10483877
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104838772023-09-08 Detection of diabetic patients in people with normal fasting glucose using machine learning Lv, Kun Cui, Chunmei Fan, Rui Zha, Xiaojuan Wang, Pengyu Zhang, Jun Zhang, Lina Ke, Jing Zhao, Dong Cui, Qinghua Yang, Liming BMC Med Research Article BACKGROUND: Diabetes mellitus (DM) is a chronic metabolic disease that could produce severe complications threatening life. Its early detection is thus quite important for the timely prevention and treatment. Normally, fasting blood glucose (FBG) by physical examination is used for large-scale screening of DM; however, some people with normal fasting glucose (NFG) actually have suffered from diabetes but are missed by the examination. This study aimed to investigate whether common physical examination indexes for diabetes can be used to identify the diabetes individuals from the populations with NFG. METHODS: The physical examination data from over 60,000 individuals with NFG in three Chinese cohorts were used. The diabetes patients were defined by HbA1c ≥ 48 mmol/mol (6.5%). We constructed the models using multiple machine learning methods, including logistic regression, random forest, deep neural network, and support vector machine, and selected the optimal one on the validation set. A framework using permutation feature importance algorithm was devised to discover the personalized risk factors. RESULTS: The prediction model constructed by logistic regression achieved the best performance with an AUC, sensitivity, and specificity of 0.899, 85.0%, and 81.1% on the validation set and 0.872, 77.9%, and 81.0% on the test set, respectively. Following feature selection, the final classifier only requiring 13 features, named as DRING (diabetes risk of individuals with normal fasting glucose), exhibited reliable performance on two newly recruited independent datasets, with the AUC of 0.964 and 0.899, the balanced accuracy of 84.2% and 81.1%, the sensitivity of 100% and 76.2%, and the specificity of 68.3% and 86.0%, respectively. The feature importance ranking analysis revealed that BMI, age, sex, absolute lymphocyte count, and mean corpuscular volume are important factors for the risk stratification of diabetes. With a case, the framework for identifying personalized risk factors revealed FBG, age, and BMI as significant hazard factors that contribute to an increased incidence of diabetes. DRING webserver is available for ease of application (http://www.cuilab.cn/dring). CONCLUSIONS: DRING was demonstrated to perform well on identifying the diabetes individuals among populations with NFG, which could aid in early diagnosis and interventions for those individuals who are most likely missed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12916-023-03045-9. BioMed Central 2023-09-07 /pmc/articles/PMC10483877/ /pubmed/37674168 http://dx.doi.org/10.1186/s12916-023-03045-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Lv, Kun
Cui, Chunmei
Fan, Rui
Zha, Xiaojuan
Wang, Pengyu
Zhang, Jun
Zhang, Lina
Ke, Jing
Zhao, Dong
Cui, Qinghua
Yang, Liming
Detection of diabetic patients in people with normal fasting glucose using machine learning
title Detection of diabetic patients in people with normal fasting glucose using machine learning
title_full Detection of diabetic patients in people with normal fasting glucose using machine learning
title_fullStr Detection of diabetic patients in people with normal fasting glucose using machine learning
title_full_unstemmed Detection of diabetic patients in people with normal fasting glucose using machine learning
title_short Detection of diabetic patients in people with normal fasting glucose using machine learning
title_sort detection of diabetic patients in people with normal fasting glucose using machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483877/
https://www.ncbi.nlm.nih.gov/pubmed/37674168
http://dx.doi.org/10.1186/s12916-023-03045-9
work_keys_str_mv AT lvkun detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT cuichunmei detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT fanrui detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT zhaxiaojuan detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT wangpengyu detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT zhangjun detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT zhanglina detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT kejing detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT zhaodong detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT cuiqinghua detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning
AT yangliming detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning