Cargando…

Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms

This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks’ gestation), E1 (11–13 weeks’ gestation) and M1 (14–24 weeks’ ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Byung Soo, Lee, Seon Ui, Hong, Subeen, Choi, Sae Kyung, Shin, Jae Eun, Wie, Jeong Ha, Jo, Yun Sung, Kim, Yeon Hee, Kil, Kicheol, Chung, Yoo Hyun, Jung, Kyunghoon, Hong, Hanul, Park, In Yang, Ko, Hyun Sun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10432552/
https://www.ncbi.nlm.nih.gov/pubmed/37587201
http://dx.doi.org/10.1038/s41598-023-39680-8
_version_ 1785091442166726656
author Kang, Byung Soo
Lee, Seon Ui
Hong, Subeen
Choi, Sae Kyung
Shin, Jae Eun
Wie, Jeong Ha
Jo, Yun Sung
Kim, Yeon Hee
Kil, Kicheol
Chung, Yoo Hyun
Jung, Kyunghoon
Hong, Hanul
Park, In Yang
Ko, Hyun Sun
author_facet Kang, Byung Soo
Lee, Seon Ui
Hong, Subeen
Choi, Sae Kyung
Shin, Jae Eun
Wie, Jeong Ha
Jo, Yun Sung
Kim, Yeon Hee
Kil, Kicheol
Chung, Yoo Hyun
Jung, Kyunghoon
Hong, Hanul
Park, In Yang
Ko, Hyun Sun
author_sort Kang, Byung Soo
collection PubMed
description This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks’ gestation), E1 (11–13 weeks’ gestation) and M1 (14–24 weeks’ gestation). The data set was randomly divided into training and test sets (7:3 ratio) to compare the performances of light gradient boosting machine (LGBM) and extreme gradient boosting (XGBoost) algorithms, with a full set of variables (original). A prediction model with the whole cohort achieved area under the receiver operating characteristics curve (AUC) and area under the precision-recall curve (AUPR) values of 0.711 and 0.246 at baseline, 0.720 and 0.256 at E0, 0.721 and 0.262 at E1, and 0.804 and 0.442 at M1, respectively. Then comparison of three models with different variable sets were performed: [a] variables from clinical guidelines; [b] selected variables from Shapley additive explanations (SHAP) values; and [c] Boruta algorithms. Based on model [c] with the least variables and similar or better performance than the other models, simple questionnaires were developed. The combined use of maternal factors and laboratory data could effectively predict individual risk of GDM using a machine learning model.
format Online
Article
Text
id pubmed-10432552
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104325522023-08-18 Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms Kang, Byung Soo Lee, Seon Ui Hong, Subeen Choi, Sae Kyung Shin, Jae Eun Wie, Jeong Ha Jo, Yun Sung Kim, Yeon Hee Kil, Kicheol Chung, Yoo Hyun Jung, Kyunghoon Hong, Hanul Park, In Yang Ko, Hyun Sun Sci Rep Article This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks’ gestation), E1 (11–13 weeks’ gestation) and M1 (14–24 weeks’ gestation). The data set was randomly divided into training and test sets (7:3 ratio) to compare the performances of light gradient boosting machine (LGBM) and extreme gradient boosting (XGBoost) algorithms, with a full set of variables (original). A prediction model with the whole cohort achieved area under the receiver operating characteristics curve (AUC) and area under the precision-recall curve (AUPR) values of 0.711 and 0.246 at baseline, 0.720 and 0.256 at E0, 0.721 and 0.262 at E1, and 0.804 and 0.442 at M1, respectively. Then comparison of three models with different variable sets were performed: [a] variables from clinical guidelines; [b] selected variables from Shapley additive explanations (SHAP) values; and [c] Boruta algorithms. Based on model [c] with the least variables and similar or better performance than the other models, simple questionnaires were developed. The combined use of maternal factors and laboratory data could effectively predict individual risk of GDM using a machine learning model. Nature Publishing Group UK 2023-08-16 /pmc/articles/PMC10432552/ /pubmed/37587201 http://dx.doi.org/10.1038/s41598-023-39680-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Kang, Byung Soo
Lee, Seon Ui
Hong, Subeen
Choi, Sae Kyung
Shin, Jae Eun
Wie, Jeong Ha
Jo, Yun Sung
Kim, Yeon Hee
Kil, Kicheol
Chung, Yoo Hyun
Jung, Kyunghoon
Hong, Hanul
Park, In Yang
Ko, Hyun Sun
Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms
title Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms
title_full Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms
title_fullStr Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms
title_full_unstemmed Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms
title_short Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms
title_sort prediction of gestational diabetes mellitus in asian women using machine learning algorithms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10432552/
https://www.ncbi.nlm.nih.gov/pubmed/37587201
http://dx.doi.org/10.1038/s41598-023-39680-8
work_keys_str_mv AT kangbyungsoo predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT leeseonui predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT hongsubeen predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT choisaekyung predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT shinjaeeun predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT wiejeongha predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT joyunsung predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT kimyeonhee predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT kilkicheol predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT chungyoohyun predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT jungkyunghoon predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT honghanul predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT parkinyang predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms
AT kohyunsun predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms