Cargando…

Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods

We compared the prediction performance of machine learning-based undiagnosed diabetes prediction models with that of traditional statistics-based prediction models. We used the 2014–2020 Korean National Health and Nutrition Examination Survey (KNHANES) (N = 32,827). The KNHANES 2014–2018 data were u...

Descripción completa

Detalles Bibliográficos
Autores principales:	Choi, Seong Gyu, Oh, Minsuk, Park, Dong–Hyuk, Lee, Byeongchan, Lee, Yong-ho, Jee, Sun Ha, Jeon, Justin Y.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10421881/ https://www.ncbi.nlm.nih.gov/pubmed/37567907 http://dx.doi.org/10.1038/s41598-023-40170-0

_version_	1785089071331147776
author	Choi, Seong Gyu Oh, Minsuk Park, Dong–Hyuk Lee, Byeongchan Lee, Yong-ho Jee, Sun Ha Jeon, Justin Y.
author_facet	Choi, Seong Gyu Oh, Minsuk Park, Dong–Hyuk Lee, Byeongchan Lee, Yong-ho Jee, Sun Ha Jeon, Justin Y.
author_sort	Choi, Seong Gyu
collection	PubMed
description	We compared the prediction performance of machine learning-based undiagnosed diabetes prediction models with that of traditional statistics-based prediction models. We used the 2014–2020 Korean National Health and Nutrition Examination Survey (KNHANES) (N = 32,827). The KNHANES 2014–2018 data were used as training and internal validation sets and the 2019–2020 data as external validation sets. The receiver operating characteristic curve area under the curve (AUC) was used to compare the prediction performance of the machine learning-based and the traditional statistics-based prediction models. Using sex, age, resting heart rate, and waist circumference as features, the machine learning-based model showed a higher AUC (0.788 vs. 0.740) than that of the traditional statistical-based prediction model. Using sex, age, waist circumference, family history of diabetes, hypertension, alcohol consumption, and smoking status as features, the machine learning-based prediction model showed a higher AUC (0.802 vs. 0.759) than the traditional statistical-based prediction model. The machine learning-based prediction model using features for maximum prediction performance showed a higher AUC (0.819 vs. 0.765) than the traditional statistical-based prediction model. Machine learning-based prediction models using anthropometric and lifestyle measurements may outperform the traditional statistics-based prediction models in predicting undiagnosed diabetes.
format	Online Article Text
id	pubmed-10421881
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-104218812023-08-13 Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods Choi, Seong Gyu Oh, Minsuk Park, Dong–Hyuk Lee, Byeongchan Lee, Yong-ho Jee, Sun Ha Jeon, Justin Y. Sci Rep Article We compared the prediction performance of machine learning-based undiagnosed diabetes prediction models with that of traditional statistics-based prediction models. We used the 2014–2020 Korean National Health and Nutrition Examination Survey (KNHANES) (N = 32,827). The KNHANES 2014–2018 data were used as training and internal validation sets and the 2019–2020 data as external validation sets. The receiver operating characteristic curve area under the curve (AUC) was used to compare the prediction performance of the machine learning-based and the traditional statistics-based prediction models. Using sex, age, resting heart rate, and waist circumference as features, the machine learning-based model showed a higher AUC (0.788 vs. 0.740) than that of the traditional statistical-based prediction model. Using sex, age, waist circumference, family history of diabetes, hypertension, alcohol consumption, and smoking status as features, the machine learning-based prediction model showed a higher AUC (0.802 vs. 0.759) than the traditional statistical-based prediction model. The machine learning-based prediction model using features for maximum prediction performance showed a higher AUC (0.819 vs. 0.765) than the traditional statistical-based prediction model. Machine learning-based prediction models using anthropometric and lifestyle measurements may outperform the traditional statistics-based prediction models in predicting undiagnosed diabetes. Nature Publishing Group UK 2023-08-11 /pmc/articles/PMC10421881/ /pubmed/37567907 http://dx.doi.org/10.1038/s41598-023-40170-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Choi, Seong Gyu Oh, Minsuk Park, Dong–Hyuk Lee, Byeongchan Lee, Yong-ho Jee, Sun Ha Jeon, Justin Y. Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods
title	Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods
title_full	Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods
title_fullStr	Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods
title_full_unstemmed	Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods
title_short	Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods
title_sort	comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10421881/ https://www.ncbi.nlm.nih.gov/pubmed/37567907 http://dx.doi.org/10.1038/s41598-023-40170-0
work_keys_str_mv	AT choiseonggyu comparisonsofthepredictionmodelsforundiagnoseddiabetesbetweenmachinelearningversustraditionalstatisticalmethods AT ohminsuk comparisonsofthepredictionmodelsforundiagnoseddiabetesbetweenmachinelearningversustraditionalstatisticalmethods AT parkdonghyuk comparisonsofthepredictionmodelsforundiagnoseddiabetesbetweenmachinelearningversustraditionalstatisticalmethods AT leebyeongchan comparisonsofthepredictionmodelsforundiagnoseddiabetesbetweenmachinelearningversustraditionalstatisticalmethods AT leeyongho comparisonsofthepredictionmodelsforundiagnoseddiabetesbetweenmachinelearningversustraditionalstatisticalmethods AT jeesunha comparisonsofthepredictionmodelsforundiagnoseddiabetesbetweenmachinelearningversustraditionalstatisticalmethods AT jeonjustiny comparisonsofthepredictionmodelsforundiagnoseddiabetesbetweenmachinelearningversustraditionalstatisticalmethods

Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods

Ejemplares similares