Cargando…
Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases
Objective: Congenital heart diseases (CHDs) are associated with an extremely heavy global disease burden as the most common category of birth defects. Genetic and environmental factors have been identified as risk factors of CHDs previously. However, high volume clinical indicators have never been c...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8777022/ https://www.ncbi.nlm.nih.gov/pubmed/35071361 http://dx.doi.org/10.3389/fcvm.2021.797002 |
_version_ | 1784636967526334464 |
---|---|
author | Qu, Yanji Deng, Xinlei Lin, Shao Han, Fengzhen Chang, Howard H. Ou, Yanqiu Nie, Zhiqiang Mai, Jinzhuang Wang, Ximeng Gao, Xiangmin Wu, Yong Chen, Jimei Zhuang, Jian Ryan, Ian Liu, Xiaoqing |
author_facet | Qu, Yanji Deng, Xinlei Lin, Shao Han, Fengzhen Chang, Howard H. Ou, Yanqiu Nie, Zhiqiang Mai, Jinzhuang Wang, Ximeng Gao, Xiangmin Wu, Yong Chen, Jimei Zhuang, Jian Ryan, Ian Liu, Xiaoqing |
author_sort | Qu, Yanji |
collection | PubMed |
description | Objective: Congenital heart diseases (CHDs) are associated with an extremely heavy global disease burden as the most common category of birth defects. Genetic and environmental factors have been identified as risk factors of CHDs previously. However, high volume clinical indicators have never been considered when predicting CHDs. This study aimed to predict the occurrence of CHDs by considering thousands of variables from self-reported questionnaires and routinely collected clinical laboratory data using machine learning algorithms. Methods: We conducted a birth cohort study at one of the largest cardiac centers in China from 2011 to 2017. All fetuses were screened for CHDs using ultrasound and cases were confirmed by at least two pediatric cardiologists using echocardiogram. A total of 1,127 potential predictors were included to predict CHDs. We used the Explainable Boosting Machine (EBM) for prediction and evaluated the model performance using area under the Receive Operating Characteristics (ROC) curves (AUC). The top predictors were selected according to their contributions and predictive values. Thresholds were calculated for the most significant predictors. Results: Overall, 5,390 mother-child pairs were recruited. Our prediction model achieved an AUC of 76% (69-83%) from out-of-sample predictions. Among the top 35 predictors of CHDs we identified, 34 were from clinical laboratory tests and only one was from the questionnaire (abortion history). Total accuracy, sensitivity, and specificity were 0.65, 0.74, and 0.65, respectively. Maternal serum uric acid (UA), glucose, and coagulation levels were the most consistent and significant predictors of CHDs. According to the thresholds of the predictors identified in our study, which did not reach the current clinical diagnosis criteria, elevated UA (>4.38 mg/dl), shortened activated partial thromboplastin time (<33.33 s), and elevated glucose levels were the most important predictors and were associated with ranges of 1.17-1.54 relative risks of CHDs. We have developed an online predictive tool for CHDs based on our findings that may help screening and prevention of CHDs. Conclusions: Maternal UA, glucose, and coagulation levels were the most consistent and significant predictors of CHDs. Thresholds below the current clinical definition of “abnormal” for these predictors could be used to help develop CHD screening and prevention strategies. |
format | Online Article Text |
id | pubmed-8777022 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-87770222022-01-22 Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases Qu, Yanji Deng, Xinlei Lin, Shao Han, Fengzhen Chang, Howard H. Ou, Yanqiu Nie, Zhiqiang Mai, Jinzhuang Wang, Ximeng Gao, Xiangmin Wu, Yong Chen, Jimei Zhuang, Jian Ryan, Ian Liu, Xiaoqing Front Cardiovasc Med Cardiovascular Medicine Objective: Congenital heart diseases (CHDs) are associated with an extremely heavy global disease burden as the most common category of birth defects. Genetic and environmental factors have been identified as risk factors of CHDs previously. However, high volume clinical indicators have never been considered when predicting CHDs. This study aimed to predict the occurrence of CHDs by considering thousands of variables from self-reported questionnaires and routinely collected clinical laboratory data using machine learning algorithms. Methods: We conducted a birth cohort study at one of the largest cardiac centers in China from 2011 to 2017. All fetuses were screened for CHDs using ultrasound and cases were confirmed by at least two pediatric cardiologists using echocardiogram. A total of 1,127 potential predictors were included to predict CHDs. We used the Explainable Boosting Machine (EBM) for prediction and evaluated the model performance using area under the Receive Operating Characteristics (ROC) curves (AUC). The top predictors were selected according to their contributions and predictive values. Thresholds were calculated for the most significant predictors. Results: Overall, 5,390 mother-child pairs were recruited. Our prediction model achieved an AUC of 76% (69-83%) from out-of-sample predictions. Among the top 35 predictors of CHDs we identified, 34 were from clinical laboratory tests and only one was from the questionnaire (abortion history). Total accuracy, sensitivity, and specificity were 0.65, 0.74, and 0.65, respectively. Maternal serum uric acid (UA), glucose, and coagulation levels were the most consistent and significant predictors of CHDs. According to the thresholds of the predictors identified in our study, which did not reach the current clinical diagnosis criteria, elevated UA (>4.38 mg/dl), shortened activated partial thromboplastin time (<33.33 s), and elevated glucose levels were the most important predictors and were associated with ranges of 1.17-1.54 relative risks of CHDs. We have developed an online predictive tool for CHDs based on our findings that may help screening and prevention of CHDs. Conclusions: Maternal UA, glucose, and coagulation levels were the most consistent and significant predictors of CHDs. Thresholds below the current clinical definition of “abnormal” for these predictors could be used to help develop CHD screening and prevention strategies. Frontiers Media S.A. 2022-01-07 /pmc/articles/PMC8777022/ /pubmed/35071361 http://dx.doi.org/10.3389/fcvm.2021.797002 Text en Copyright © 2022 Qu, Deng, Lin, Han, Chang, Ou, Nie, Mai, Wang, Gao, Wu, Chen, Zhuang, Ryan and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cardiovascular Medicine Qu, Yanji Deng, Xinlei Lin, Shao Han, Fengzhen Chang, Howard H. Ou, Yanqiu Nie, Zhiqiang Mai, Jinzhuang Wang, Ximeng Gao, Xiangmin Wu, Yong Chen, Jimei Zhuang, Jian Ryan, Ian Liu, Xiaoqing Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases |
title | Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases |
title_full | Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases |
title_fullStr | Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases |
title_full_unstemmed | Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases |
title_short | Using Innovative Machine Learning Methods to Screen and Identify Predictors of Congenital Heart Diseases |
title_sort | using innovative machine learning methods to screen and identify predictors of congenital heart diseases |
topic | Cardiovascular Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8777022/ https://www.ncbi.nlm.nih.gov/pubmed/35071361 http://dx.doi.org/10.3389/fcvm.2021.797002 |
work_keys_str_mv | AT quyanji usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT dengxinlei usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT linshao usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT hanfengzhen usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT changhowardh usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT ouyanqiu usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT niezhiqiang usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT maijinzhuang usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT wangximeng usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT gaoxiangmin usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT wuyong usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT chenjimei usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT zhuangjian usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT ryanian usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases AT liuxiaoqing usinginnovativemachinelearningmethodstoscreenandidentifypredictorsofcongenitalheartdiseases |