Cargando…
Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms
This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks’ gestation), E1 (11–13 weeks’ gestation) and M1 (14–24 weeks’ ge...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10432552/ https://www.ncbi.nlm.nih.gov/pubmed/37587201 http://dx.doi.org/10.1038/s41598-023-39680-8 |
_version_ | 1785091442166726656 |
---|---|
author | Kang, Byung Soo Lee, Seon Ui Hong, Subeen Choi, Sae Kyung Shin, Jae Eun Wie, Jeong Ha Jo, Yun Sung Kim, Yeon Hee Kil, Kicheol Chung, Yoo Hyun Jung, Kyunghoon Hong, Hanul Park, In Yang Ko, Hyun Sun |
author_facet | Kang, Byung Soo Lee, Seon Ui Hong, Subeen Choi, Sae Kyung Shin, Jae Eun Wie, Jeong Ha Jo, Yun Sung Kim, Yeon Hee Kil, Kicheol Chung, Yoo Hyun Jung, Kyunghoon Hong, Hanul Park, In Yang Ko, Hyun Sun |
author_sort | Kang, Byung Soo |
collection | PubMed |
description | This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks’ gestation), E1 (11–13 weeks’ gestation) and M1 (14–24 weeks’ gestation). The data set was randomly divided into training and test sets (7:3 ratio) to compare the performances of light gradient boosting machine (LGBM) and extreme gradient boosting (XGBoost) algorithms, with a full set of variables (original). A prediction model with the whole cohort achieved area under the receiver operating characteristics curve (AUC) and area under the precision-recall curve (AUPR) values of 0.711 and 0.246 at baseline, 0.720 and 0.256 at E0, 0.721 and 0.262 at E1, and 0.804 and 0.442 at M1, respectively. Then comparison of three models with different variable sets were performed: [a] variables from clinical guidelines; [b] selected variables from Shapley additive explanations (SHAP) values; and [c] Boruta algorithms. Based on model [c] with the least variables and similar or better performance than the other models, simple questionnaires were developed. The combined use of maternal factors and laboratory data could effectively predict individual risk of GDM using a machine learning model. |
format | Online Article Text |
id | pubmed-10432552 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-104325522023-08-18 Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms Kang, Byung Soo Lee, Seon Ui Hong, Subeen Choi, Sae Kyung Shin, Jae Eun Wie, Jeong Ha Jo, Yun Sung Kim, Yeon Hee Kil, Kicheol Chung, Yoo Hyun Jung, Kyunghoon Hong, Hanul Park, In Yang Ko, Hyun Sun Sci Rep Article This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks’ gestation), E1 (11–13 weeks’ gestation) and M1 (14–24 weeks’ gestation). The data set was randomly divided into training and test sets (7:3 ratio) to compare the performances of light gradient boosting machine (LGBM) and extreme gradient boosting (XGBoost) algorithms, with a full set of variables (original). A prediction model with the whole cohort achieved area under the receiver operating characteristics curve (AUC) and area under the precision-recall curve (AUPR) values of 0.711 and 0.246 at baseline, 0.720 and 0.256 at E0, 0.721 and 0.262 at E1, and 0.804 and 0.442 at M1, respectively. Then comparison of three models with different variable sets were performed: [a] variables from clinical guidelines; [b] selected variables from Shapley additive explanations (SHAP) values; and [c] Boruta algorithms. Based on model [c] with the least variables and similar or better performance than the other models, simple questionnaires were developed. The combined use of maternal factors and laboratory data could effectively predict individual risk of GDM using a machine learning model. Nature Publishing Group UK 2023-08-16 /pmc/articles/PMC10432552/ /pubmed/37587201 http://dx.doi.org/10.1038/s41598-023-39680-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Kang, Byung Soo Lee, Seon Ui Hong, Subeen Choi, Sae Kyung Shin, Jae Eun Wie, Jeong Ha Jo, Yun Sung Kim, Yeon Hee Kil, Kicheol Chung, Yoo Hyun Jung, Kyunghoon Hong, Hanul Park, In Yang Ko, Hyun Sun Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms |
title | Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms |
title_full | Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms |
title_fullStr | Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms |
title_full_unstemmed | Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms |
title_short | Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms |
title_sort | prediction of gestational diabetes mellitus in asian women using machine learning algorithms |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10432552/ https://www.ncbi.nlm.nih.gov/pubmed/37587201 http://dx.doi.org/10.1038/s41598-023-39680-8 |
work_keys_str_mv | AT kangbyungsoo predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT leeseonui predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT hongsubeen predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT choisaekyung predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT shinjaeeun predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT wiejeongha predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT joyunsung predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT kimyeonhee predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT kilkicheol predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT chungyoohyun predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT jungkyunghoon predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT honghanul predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT parkinyang predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms AT kohyunsun predictionofgestationaldiabetesmellitusinasianwomenusingmachinelearningalgorithms |