Cargando…
Development of Various Diabetes Prediction Models Using Machine Learning Techniques
BACKGROUND: There are many models for predicting diabetes mellitus (DM), but their clinical implication remains vague. Therefore, we aimed to create various DM prediction models using easily accessible health screening test parameters. METHODS: Two sets of variables were used to develop eight DM pre...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korean Diabetes Association
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353566/ https://www.ncbi.nlm.nih.gov/pubmed/35272434 http://dx.doi.org/10.4093/dmj.2021.0115 |
_version_ | 1784762892501909504 |
---|---|
author | Shin, Juyoung Kim, Jaewon Lee, Chanjung Yoon, Joon Young Kim, Seyeon Song, Seungjae Kim, Hun-Sung |
author_facet | Shin, Juyoung Kim, Jaewon Lee, Chanjung Yoon, Joon Young Kim, Seyeon Song, Seungjae Kim, Hun-Sung |
author_sort | Shin, Juyoung |
collection | PubMed |
description | BACKGROUND: There are many models for predicting diabetes mellitus (DM), but their clinical implication remains vague. Therefore, we aimed to create various DM prediction models using easily accessible health screening test parameters. METHODS: Two sets of variables were used to develop eight DM prediction models. One set comprised 62 easily accessible examination results of commonly used variables from a tertiary university hospital. The second set comprised 27 of the 62 variables included in the national routine health checkups. Gradient boosting and random forest algorithms were used to develop the models. Internal validation was performed using the stratified 10-fold cross-validation method. RESULTS: The area under the receiver operating characteristic curve (ROC-AUC) for the 62-variable DM model making 12-month predictions for subjects without diabetes was the largest (0.928) among those of the eight DM prediction models. The ROC-AUC dropped by more than 0.04 when training with the simplified 27-variable set but still showed fairly good performance with ROC-AUCs between 0.842 and 0.880. The accuracy was up to 11.5% higher (from 0.807 to 0.714) when fasting glucose was included. CONCLUSION: We created easily applicable diabetes prediction models that deliver good performance using parameters commonly assessed during tertiary university hospital and national routine health checkups. We plan to perform prospective external validation, hoping that the developed DM prediction models will be widely used in clinical practice. |
format | Online Article Text |
id | pubmed-9353566 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Korean Diabetes Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-93535662022-08-11 Development of Various Diabetes Prediction Models Using Machine Learning Techniques Shin, Juyoung Kim, Jaewon Lee, Chanjung Yoon, Joon Young Kim, Seyeon Song, Seungjae Kim, Hun-Sung Diabetes Metab J Original Article BACKGROUND: There are many models for predicting diabetes mellitus (DM), but their clinical implication remains vague. Therefore, we aimed to create various DM prediction models using easily accessible health screening test parameters. METHODS: Two sets of variables were used to develop eight DM prediction models. One set comprised 62 easily accessible examination results of commonly used variables from a tertiary university hospital. The second set comprised 27 of the 62 variables included in the national routine health checkups. Gradient boosting and random forest algorithms were used to develop the models. Internal validation was performed using the stratified 10-fold cross-validation method. RESULTS: The area under the receiver operating characteristic curve (ROC-AUC) for the 62-variable DM model making 12-month predictions for subjects without diabetes was the largest (0.928) among those of the eight DM prediction models. The ROC-AUC dropped by more than 0.04 when training with the simplified 27-variable set but still showed fairly good performance with ROC-AUCs between 0.842 and 0.880. The accuracy was up to 11.5% higher (from 0.807 to 0.714) when fasting glucose was included. CONCLUSION: We created easily applicable diabetes prediction models that deliver good performance using parameters commonly assessed during tertiary university hospital and national routine health checkups. We plan to perform prospective external validation, hoping that the developed DM prediction models will be widely used in clinical practice. Korean Diabetes Association 2022-07 2022-03-11 /pmc/articles/PMC9353566/ /pubmed/35272434 http://dx.doi.org/10.4093/dmj.2021.0115 Text en Copyright © 2022 Korean Diabetes Association https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Shin, Juyoung Kim, Jaewon Lee, Chanjung Yoon, Joon Young Kim, Seyeon Song, Seungjae Kim, Hun-Sung Development of Various Diabetes Prediction Models Using Machine Learning Techniques |
title | Development of Various Diabetes Prediction Models Using Machine Learning Techniques |
title_full | Development of Various Diabetes Prediction Models Using Machine Learning Techniques |
title_fullStr | Development of Various Diabetes Prediction Models Using Machine Learning Techniques |
title_full_unstemmed | Development of Various Diabetes Prediction Models Using Machine Learning Techniques |
title_short | Development of Various Diabetes Prediction Models Using Machine Learning Techniques |
title_sort | development of various diabetes prediction models using machine learning techniques |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353566/ https://www.ncbi.nlm.nih.gov/pubmed/35272434 http://dx.doi.org/10.4093/dmj.2021.0115 |
work_keys_str_mv | AT shinjuyoung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques AT kimjaewon developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques AT leechanjung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques AT yoonjoonyoung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques AT kimseyeon developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques AT songseungjae developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques AT kimhunsung developmentofvariousdiabetespredictionmodelsusingmachinelearningtechniques |