Cargando…
Detection of diabetic patients in people with normal fasting glucose using machine learning
BACKGROUND: Diabetes mellitus (DM) is a chronic metabolic disease that could produce severe complications threatening life. Its early detection is thus quite important for the timely prevention and treatment. Normally, fasting blood glucose (FBG) by physical examination is used for large-scale scree...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483877/ https://www.ncbi.nlm.nih.gov/pubmed/37674168 http://dx.doi.org/10.1186/s12916-023-03045-9 |
_version_ | 1785102480615407616 |
---|---|
author | Lv, Kun Cui, Chunmei Fan, Rui Zha, Xiaojuan Wang, Pengyu Zhang, Jun Zhang, Lina Ke, Jing Zhao, Dong Cui, Qinghua Yang, Liming |
author_facet | Lv, Kun Cui, Chunmei Fan, Rui Zha, Xiaojuan Wang, Pengyu Zhang, Jun Zhang, Lina Ke, Jing Zhao, Dong Cui, Qinghua Yang, Liming |
author_sort | Lv, Kun |
collection | PubMed |
description | BACKGROUND: Diabetes mellitus (DM) is a chronic metabolic disease that could produce severe complications threatening life. Its early detection is thus quite important for the timely prevention and treatment. Normally, fasting blood glucose (FBG) by physical examination is used for large-scale screening of DM; however, some people with normal fasting glucose (NFG) actually have suffered from diabetes but are missed by the examination. This study aimed to investigate whether common physical examination indexes for diabetes can be used to identify the diabetes individuals from the populations with NFG. METHODS: The physical examination data from over 60,000 individuals with NFG in three Chinese cohorts were used. The diabetes patients were defined by HbA1c ≥ 48 mmol/mol (6.5%). We constructed the models using multiple machine learning methods, including logistic regression, random forest, deep neural network, and support vector machine, and selected the optimal one on the validation set. A framework using permutation feature importance algorithm was devised to discover the personalized risk factors. RESULTS: The prediction model constructed by logistic regression achieved the best performance with an AUC, sensitivity, and specificity of 0.899, 85.0%, and 81.1% on the validation set and 0.872, 77.9%, and 81.0% on the test set, respectively. Following feature selection, the final classifier only requiring 13 features, named as DRING (diabetes risk of individuals with normal fasting glucose), exhibited reliable performance on two newly recruited independent datasets, with the AUC of 0.964 and 0.899, the balanced accuracy of 84.2% and 81.1%, the sensitivity of 100% and 76.2%, and the specificity of 68.3% and 86.0%, respectively. The feature importance ranking analysis revealed that BMI, age, sex, absolute lymphocyte count, and mean corpuscular volume are important factors for the risk stratification of diabetes. With a case, the framework for identifying personalized risk factors revealed FBG, age, and BMI as significant hazard factors that contribute to an increased incidence of diabetes. DRING webserver is available for ease of application (http://www.cuilab.cn/dring). CONCLUSIONS: DRING was demonstrated to perform well on identifying the diabetes individuals among populations with NFG, which could aid in early diagnosis and interventions for those individuals who are most likely missed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12916-023-03045-9. |
format | Online Article Text |
id | pubmed-10483877 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-104838772023-09-08 Detection of diabetic patients in people with normal fasting glucose using machine learning Lv, Kun Cui, Chunmei Fan, Rui Zha, Xiaojuan Wang, Pengyu Zhang, Jun Zhang, Lina Ke, Jing Zhao, Dong Cui, Qinghua Yang, Liming BMC Med Research Article BACKGROUND: Diabetes mellitus (DM) is a chronic metabolic disease that could produce severe complications threatening life. Its early detection is thus quite important for the timely prevention and treatment. Normally, fasting blood glucose (FBG) by physical examination is used for large-scale screening of DM; however, some people with normal fasting glucose (NFG) actually have suffered from diabetes but are missed by the examination. This study aimed to investigate whether common physical examination indexes for diabetes can be used to identify the diabetes individuals from the populations with NFG. METHODS: The physical examination data from over 60,000 individuals with NFG in three Chinese cohorts were used. The diabetes patients were defined by HbA1c ≥ 48 mmol/mol (6.5%). We constructed the models using multiple machine learning methods, including logistic regression, random forest, deep neural network, and support vector machine, and selected the optimal one on the validation set. A framework using permutation feature importance algorithm was devised to discover the personalized risk factors. RESULTS: The prediction model constructed by logistic regression achieved the best performance with an AUC, sensitivity, and specificity of 0.899, 85.0%, and 81.1% on the validation set and 0.872, 77.9%, and 81.0% on the test set, respectively. Following feature selection, the final classifier only requiring 13 features, named as DRING (diabetes risk of individuals with normal fasting glucose), exhibited reliable performance on two newly recruited independent datasets, with the AUC of 0.964 and 0.899, the balanced accuracy of 84.2% and 81.1%, the sensitivity of 100% and 76.2%, and the specificity of 68.3% and 86.0%, respectively. The feature importance ranking analysis revealed that BMI, age, sex, absolute lymphocyte count, and mean corpuscular volume are important factors for the risk stratification of diabetes. With a case, the framework for identifying personalized risk factors revealed FBG, age, and BMI as significant hazard factors that contribute to an increased incidence of diabetes. DRING webserver is available for ease of application (http://www.cuilab.cn/dring). CONCLUSIONS: DRING was demonstrated to perform well on identifying the diabetes individuals among populations with NFG, which could aid in early diagnosis and interventions for those individuals who are most likely missed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12916-023-03045-9. BioMed Central 2023-09-07 /pmc/articles/PMC10483877/ /pubmed/37674168 http://dx.doi.org/10.1186/s12916-023-03045-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Lv, Kun Cui, Chunmei Fan, Rui Zha, Xiaojuan Wang, Pengyu Zhang, Jun Zhang, Lina Ke, Jing Zhao, Dong Cui, Qinghua Yang, Liming Detection of diabetic patients in people with normal fasting glucose using machine learning |
title | Detection of diabetic patients in people with normal fasting glucose using machine learning |
title_full | Detection of diabetic patients in people with normal fasting glucose using machine learning |
title_fullStr | Detection of diabetic patients in people with normal fasting glucose using machine learning |
title_full_unstemmed | Detection of diabetic patients in people with normal fasting glucose using machine learning |
title_short | Detection of diabetic patients in people with normal fasting glucose using machine learning |
title_sort | detection of diabetic patients in people with normal fasting glucose using machine learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483877/ https://www.ncbi.nlm.nih.gov/pubmed/37674168 http://dx.doi.org/10.1186/s12916-023-03045-9 |
work_keys_str_mv | AT lvkun detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT cuichunmei detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT fanrui detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT zhaxiaojuan detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT wangpengyu detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT zhangjun detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT zhanglina detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT kejing detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT zhaodong detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT cuiqinghua detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning AT yangliming detectionofdiabeticpatientsinpeoplewithnormalfastingglucoseusingmachinelearning |