Cargando…

Development of Various Diabetes Prediction Models Using Machine Learning Techniques

BACKGROUND: There are many models for predicting diabetes mellitus (DM), but their clinical implication remains vague. Therefore, we aimed to create various DM prediction models using easily accessible health screening test parameters. METHODS: Two sets of variables were used to develop eight DM pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Shin, Juyoung, Kim, Jaewon, Lee, Chanjung, Yoon, Joon Young, Kim, Seyeon, Song, Seungjae, Kim, Hun-Sung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Diabetes Association 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353566/
https://www.ncbi.nlm.nih.gov/pubmed/35272434
http://dx.doi.org/10.4093/dmj.2021.0115
_version_ 1784762892501909504
author Shin, Juyoung
Kim, Jaewon
Lee, Chanjung
Yoon, Joon Young
Kim, Seyeon
Song, Seungjae
Kim, Hun-Sung
author_facet Shin, Juyoung
Kim, Jaewon
Lee, Chanjung
Yoon, Joon Young
Kim, Seyeon
Song, Seungjae
Kim, Hun-Sung
author_sort Shin, Juyoung
collection PubMed
description BACKGROUND: There are many models for predicting diabetes mellitus (DM), but their clinical implication remains vague. Therefore, we aimed to create various DM prediction models using easily accessible health screening test parameters. METHODS: Two sets of variables were used to develop eight DM prediction models. One set comprised 62 easily accessible examination results of commonly used variables from a tertiary university hospital. The second set comprised 27 of the 62 variables included in the national routine health checkups. Gradient boosting and random forest algorithms were used to develop the models. Internal validation was performed using the stratified 10-fold cross-validation method. RESULTS: The area under the receiver operating characteristic curve (ROC-AUC) for the 62-variable DM model making 12-month predictions for subjects without diabetes was the largest (0.928) among those of the eight DM prediction models. The ROC-AUC dropped by more than 0.04 when training with the simplified 27-variable set but still showed fairly good performance with ROC-AUCs between 0.842 and 0.880. The accuracy was up to 11.5% higher (from 0.807 to 0.714) when fasting glucose was included. CONCLUSION: We created easily applicable diabetes prediction models that deliver good performance using parameters commonly assessed during tertiary university hospital and national routine health checkups. We plan to perform prospective external validation, hoping that the developed DM prediction models will be widely used in clinical practice.
format Online
Article
Text
id pubmed-9353566
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Korean Diabetes Association
record_format MEDLINE/PubMed
spelling pubmed-93535662022-08-11 Development of Various Diabetes Prediction Models Using Machine Learning Techniques Shin, Juyoung Kim, Jaewon Lee, Chanjung Yoon, Joon Young Kim, Seyeon Song, Seungjae Kim, Hun-Sung Diabetes Metab J Original Article BACKGROUND: There are many models for predicting diabetes mellitus (DM), but their clinical implication remains vague. Therefore, we aimed to create various DM prediction models using easily accessible health screening test parameters. METHODS: Two sets of variables were used to develop eight DM prediction models. One set comprised 62 easily accessible examination results of commonly used variables from a tertiary university hospital. The second set comprised 27 of the 62 variables included in the national routine health checkups. Gradient boosting and random forest algorithms were used to develop the models. Internal validation was performed using the stratified 10-fold cross-validation method. RESULTS: The area under the receiver operating characteristic curve (ROC-AUC) for the 62-variable DM model making 12-month predictions for subjects without diabetes was the largest (0.928) among those of the eight DM prediction models. The ROC-AUC dropped by more than 0.04 when training with the simplified 27-variable set but still showed fairly good performance with ROC-AUCs between 0.842 and 0.880. The accuracy was up to 11.5% higher (from 0.807 to 0.714) when fasting glucose was included. CONCLUSION: We created easily applicable diabetes prediction models that deliver good performance using parameters commonly assessed during tertiary university hospital and national routine health checkups. We plan to perform prospective external validation, hoping that the developed DM prediction models will be widely used in clinical practice. Korean Diabetes Association 2022-07 2022-03-11 /pmc/articles/PMC9353566/ /pubmed/35272434 http://dx.doi.org/10.4093/dmj.2021.0115 Text en Copyright © 2022 Korean Diabetes Association https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Shin, Juyoung
Kim, Jaewon
Lee, Chanjung
Yoon, Joon Young
Kim, Seyeon
Song, Seungjae
Kim, Hun-Sung
Development of Various Diabetes Prediction Models Using Machine Learning Techniques
title Development of Various Diabetes Prediction Models Using Machine Learning Techniques
title_full Development of Various Diabetes Prediction Models Using Machine Learning Techniques
title_fullStr Development of Various Diabetes Prediction Models Using Machine Learning Techniques
title_full_unstemmed Development of Various Diabetes Prediction Models Using Machine Learning Techniques
title_short Development of Various Diabetes Prediction Models Using Machine Learning Techniques
title_sort development of various diabetes prediction models using machine learning techniques
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353566/
https://www.ncbi.nlm.nih.gov/pubmed/35272434
http://dx.doi.org/10.4093/dmj.2021.0115
work_keys_str_mv AT shinjuyoung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques
AT kimjaewon developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques
AT leechanjung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques
AT yoonjoonyoung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques
AT kimseyeon developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques
AT songseungjae developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques
AT kimhunsung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques