Cargando…

Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy

Gestational diabetes mellitus (GDM) is conventionally confirmed with oral glucose tolerance test (OGTT) in 24 to 28 weeks of gestation, but it is still uncertain whether it can be predicted with secondary use of electronic health records (EHRs) in early pregnancy. To this purpose, the cost-sensitive...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Hang, Yu, Hai-Yan, Wang, Li-Ya, Yao, Qiang, Wu, Si-Nan, Yin, Can, Fu, Bo, Zhu, Xiao-Juan, Zhang, Yan-Long, Xing, Yong, Deng, Jun, Yang, Hao, Lei, Shun-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5703904/
https://www.ncbi.nlm.nih.gov/pubmed/29180800
http://dx.doi.org/10.1038/s41598-017-16665-y
_version_ 1783281770028859392
author Qiu, Hang
Yu, Hai-Yan
Wang, Li-Ya
Yao, Qiang
Wu, Si-Nan
Yin, Can
Fu, Bo
Zhu, Xiao-Juan
Zhang, Yan-Long
Xing, Yong
Deng, Jun
Yang, Hao
Lei, Shun-Dong
author_facet Qiu, Hang
Yu, Hai-Yan
Wang, Li-Ya
Yao, Qiang
Wu, Si-Nan
Yin, Can
Fu, Bo
Zhu, Xiao-Juan
Zhang, Yan-Long
Xing, Yong
Deng, Jun
Yang, Hao
Lei, Shun-Dong
author_sort Qiu, Hang
collection PubMed
description Gestational diabetes mellitus (GDM) is conventionally confirmed with oral glucose tolerance test (OGTT) in 24 to 28 weeks of gestation, but it is still uncertain whether it can be predicted with secondary use of electronic health records (EHRs) in early pregnancy. To this purpose, the cost-sensitive hybrid model (CSHM) and five conventional machine learning methods are used to construct the predictive models, capturing the future risks of GDM in the temporally aggregated EHRs. The experimental data sources from a nested case-control study cohort, containing 33,935 gestational women in West China Second Hospital. After data cleaning, 4,378 cases and 50 attributes are stored and collected for the data set. Through selecting the most feasible method, the cost parameter of CSHM is adapted to deal with imbalance of the dataset. In the experiment, 3940 samples are used for training and the rest 438 samples for testing. Although the accuracy of positive samples is barely acceptable (62.16%), the results suggest that the vast majority (98.4%) of those predicted positive instances are real positives. To our knowledge, this is the first study to apply machine learning models with EHRs to predict GDM, which will facilitate personalized medicine in maternal health management in the future.
format Online
Article
Text
id pubmed-5703904
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-57039042017-11-30 Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy Qiu, Hang Yu, Hai-Yan Wang, Li-Ya Yao, Qiang Wu, Si-Nan Yin, Can Fu, Bo Zhu, Xiao-Juan Zhang, Yan-Long Xing, Yong Deng, Jun Yang, Hao Lei, Shun-Dong Sci Rep Article Gestational diabetes mellitus (GDM) is conventionally confirmed with oral glucose tolerance test (OGTT) in 24 to 28 weeks of gestation, but it is still uncertain whether it can be predicted with secondary use of electronic health records (EHRs) in early pregnancy. To this purpose, the cost-sensitive hybrid model (CSHM) and five conventional machine learning methods are used to construct the predictive models, capturing the future risks of GDM in the temporally aggregated EHRs. The experimental data sources from a nested case-control study cohort, containing 33,935 gestational women in West China Second Hospital. After data cleaning, 4,378 cases and 50 attributes are stored and collected for the data set. Through selecting the most feasible method, the cost parameter of CSHM is adapted to deal with imbalance of the dataset. In the experiment, 3940 samples are used for training and the rest 438 samples for testing. Although the accuracy of positive samples is barely acceptable (62.16%), the results suggest that the vast majority (98.4%) of those predicted positive instances are real positives. To our knowledge, this is the first study to apply machine learning models with EHRs to predict GDM, which will facilitate personalized medicine in maternal health management in the future. Nature Publishing Group UK 2017-11-27 /pmc/articles/PMC5703904/ /pubmed/29180800 http://dx.doi.org/10.1038/s41598-017-16665-y Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Qiu, Hang
Yu, Hai-Yan
Wang, Li-Ya
Yao, Qiang
Wu, Si-Nan
Yin, Can
Fu, Bo
Zhu, Xiao-Juan
Zhang, Yan-Long
Xing, Yong
Deng, Jun
Yang, Hao
Lei, Shun-Dong
Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy
title Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy
title_full Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy
title_fullStr Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy
title_full_unstemmed Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy
title_short Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy
title_sort electronic health record driven prediction for gestational diabetes mellitus in early pregnancy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5703904/
https://www.ncbi.nlm.nih.gov/pubmed/29180800
http://dx.doi.org/10.1038/s41598-017-16665-y
work_keys_str_mv AT qiuhang electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT yuhaiyan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT wangliya electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT yaoqiang electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT wusinan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT yincan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT fubo electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT zhuxiaojuan electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT zhangyanlong electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT xingyong electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT dengjun electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT yanghao electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy
AT leishundong electronichealthrecorddrivenpredictionforgestationaldiabetesmellitusinearlypregnancy