Cargando…
Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy
Gestational diabetes mellitus (GDM) is conventionally confirmed with oral glucose tolerance test (OGTT) in 24 to 28 weeks of gestation, but it is still uncertain whether it can be predicted with secondary use of electronic health records (EHRs) in early pregnancy. To this purpose, the cost-sensitive...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5703904/ https://www.ncbi.nlm.nih.gov/pubmed/29180800 http://dx.doi.org/10.1038/s41598-017-16665-y |
_version_ | 1783281770028859392 |
---|---|
author | Qiu, Hang Yu, Hai-Yan Wang, Li-Ya Yao, Qiang Wu, Si-Nan Yin, Can Fu, Bo Zhu, Xiao-Juan Zhang, Yan-Long Xing, Yong Deng, Jun Yang, Hao Lei, Shun-Dong |
author_facet | Qiu, Hang Yu, Hai-Yan Wang, Li-Ya Yao, Qiang Wu, Si-Nan Yin, Can Fu, Bo Zhu, Xiao-Juan Zhang, Yan-Long Xing, Yong Deng, Jun Yang, Hao Lei, Shun-Dong |
author_sort | Qiu, Hang |
collection | PubMed |
description | Gestational diabetes mellitus (GDM) is conventionally confirmed with oral glucose tolerance test (OGTT) in 24 to 28 weeks of gestation, but it is still uncertain whether it can be predicted with secondary use of electronic health records (EHRs) in early pregnancy. To this purpose, the cost-sensitive hybrid model (CSHM) and five conventional machine learning methods are used to construct the predictive models, capturing the future risks of GDM in the temporally aggregated EHRs. The experimental data sources from a nested case-control study cohort, containing 33,935 gestational women in West China Second Hospital. After data cleaning, 4,378 cases and 50 attributes are stored and collected for the data set. Through selecting the most feasible method, the cost parameter of CSHM is adapted to deal with imbalance of the dataset. In the experiment, 3940 samples are used for training and the rest 438 samples for testing. Although the accuracy of positive samples is barely acceptable (62.16%), the results suggest that the vast majority (98.4%) of those predicted positive instances are real positives. To our knowledge, this is the first study to apply machine learning models with EHRs to predict GDM, which will facilitate personalized medicine in maternal health management in the future. |
format | Online Article Text |
id | pubmed-5703904 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-57039042017-11-30 Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy Qiu, Hang Yu, Hai-Yan Wang, Li-Ya Yao, Qiang Wu, Si-Nan Yin, Can Fu, Bo Zhu, Xiao-Juan Zhang, Yan-Long Xing, Yong Deng, Jun Yang, Hao Lei, Shun-Dong Sci Rep Article Gestational diabetes mellitus (GDM) is conventionally confirmed with oral glucose tolerance test (OGTT) in 24 to 28 weeks of gestation, but it is still uncertain whether it can be predicted with secondary use of electronic health records (EHRs) in early pregnancy. To this purpose, the cost-sensitive hybrid model (CSHM) and five conventional machine learning methods are used to construct the predictive models, capturing the future risks of GDM in the temporally aggregated EHRs. The experimental data sources from a nested case-control study cohort, containing 33,935 gestational women in West China Second Hospital. After data cleaning, 4,378 cases and 50 attributes are stored and collected for the data set. Through selecting the most feasible method, the cost parameter of CSHM is adapted to deal with imbalance of the dataset. In the experiment, 3940 samples are used for training and the rest 438 samples for testing. Although the accuracy of positive samples is barely acceptable (62.16%), the results suggest that the vast majority (98.4%) of those predicted positive instances are real positives. To our knowledge, this is the first study to apply machine learning models with EHRs to predict GDM, which will facilitate personalized medicine in maternal health management in the future. Nature Publishing Group UK 2017-11-27 /pmc/articles/PMC5703904/ /pubmed/29180800 http://dx.doi.org/10.1038/s41598-017-16665-y Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Qiu, Hang Yu, Hai-Yan Wang, Li-Ya Yao, Qiang Wu, Si-Nan Yin, Can Fu, Bo Zhu, Xiao-Juan Zhang, Yan-Long Xing, Yong Deng, Jun Yang, Hao Lei, Shun-Dong Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy |
title | Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy |
title_full | Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy |
title_fullStr | Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy |
title_full_unstemmed | Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy |
title_short | Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy |
title_sort | electronic health record driven prediction for gestational diabetes mellitus in early pregnancy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5703904/ https://www.ncbi.nlm.nih.gov/pubmed/29180800 http://dx.doi.org/10.1038/s41598-017-16665-y |
work_keys_str_mv | AT qiuhang electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT yuhaiyan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT wangliya electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT yaoqiang electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT wusinan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT yincan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT fubo electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT zhuxiaojuan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT zhangyanlong electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT xingyong electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT dengjun electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT yanghao electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy AT leishundong electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy |