Cargando…
Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms
BACKGROUND: The association between prenatal pesticide exposures and a higher incidence of small-for-gestational-age (SGA) births has been reported. No prediction model has been developed for SGA neonates in pregnant women exposed to pesticides prior to pregnancy. METHODS: A retrospective cohort stu...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9394741/ https://www.ncbi.nlm.nih.gov/pubmed/36003638 http://dx.doi.org/10.3389/fpubh.2022.940182 |
_version_ | 1784771544142053376 |
---|---|
author | Bai, Xi Zhou, Zhibo Su, Mingliang Li, Yansheng Yang, Liuqing Liu, Kejia Yang, Hongbo Zhu, Huijuan Chen, Shi Pan, Hui |
author_facet | Bai, Xi Zhou, Zhibo Su, Mingliang Li, Yansheng Yang, Liuqing Liu, Kejia Yang, Hongbo Zhu, Huijuan Chen, Shi Pan, Hui |
author_sort | Bai, Xi |
collection | PubMed |
description | BACKGROUND: The association between prenatal pesticide exposures and a higher incidence of small-for-gestational-age (SGA) births has been reported. No prediction model has been developed for SGA neonates in pregnant women exposed to pesticides prior to pregnancy. METHODS: A retrospective cohort study was conducted using information from the National Free Preconception Health Examination Project between 2010 and 2012. A development set (n = 606) and a validation set (n = 151) of the dataset were split at random. Traditional logistic regression (LR) method and six machine learning classifiers were used to develop prediction models for SGA neonates. The Shapley Additive Explanation (SHAP) model was applied to determine the most influential variables that contributed to the outcome of the prediction. RESULTS: 757 neonates in total were analyzed. SGA occurred in 12.9% (n = 98) of cases overall. With an area under the receiver-operating-characteristic curve (AUC) of 0.855 [95% confidence interval (CI): 0.752–0.959], the model based on category boosting (CatBoost) algorithm obtained the best performance in the validation set. With the exception of the LR model (AUC: 0.691, 95% CI: 0.554–0.828), all models had good AUCs. Using recursive feature elimination (RFE) approach to perform the feature selection, we included 15 variables in the final model based on CatBoost classifier, achieving the AUC of 0.811 (95% CI: 0.675–0.947). CONCLUSIONS: Machine learning algorithms can develop satisfactory tools for SGA prediction in mothers exposed to pesticides prior to pregnancy, which might become a tool to predict SGA neonates in the high-risk population. |
format | Online Article Text |
id | pubmed-9394741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93947412022-08-23 Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms Bai, Xi Zhou, Zhibo Su, Mingliang Li, Yansheng Yang, Liuqing Liu, Kejia Yang, Hongbo Zhu, Huijuan Chen, Shi Pan, Hui Front Public Health Public Health BACKGROUND: The association between prenatal pesticide exposures and a higher incidence of small-for-gestational-age (SGA) births has been reported. No prediction model has been developed for SGA neonates in pregnant women exposed to pesticides prior to pregnancy. METHODS: A retrospective cohort study was conducted using information from the National Free Preconception Health Examination Project between 2010 and 2012. A development set (n = 606) and a validation set (n = 151) of the dataset were split at random. Traditional logistic regression (LR) method and six machine learning classifiers were used to develop prediction models for SGA neonates. The Shapley Additive Explanation (SHAP) model was applied to determine the most influential variables that contributed to the outcome of the prediction. RESULTS: 757 neonates in total were analyzed. SGA occurred in 12.9% (n = 98) of cases overall. With an area under the receiver-operating-characteristic curve (AUC) of 0.855 [95% confidence interval (CI): 0.752–0.959], the model based on category boosting (CatBoost) algorithm obtained the best performance in the validation set. With the exception of the LR model (AUC: 0.691, 95% CI: 0.554–0.828), all models had good AUCs. Using recursive feature elimination (RFE) approach to perform the feature selection, we included 15 variables in the final model based on CatBoost classifier, achieving the AUC of 0.811 (95% CI: 0.675–0.947). CONCLUSIONS: Machine learning algorithms can develop satisfactory tools for SGA prediction in mothers exposed to pesticides prior to pregnancy, which might become a tool to predict SGA neonates in the high-risk population. Frontiers Media S.A. 2022-08-08 /pmc/articles/PMC9394741/ /pubmed/36003638 http://dx.doi.org/10.3389/fpubh.2022.940182 Text en Copyright © 2022 Bai, Zhou, Su, Li, Yang, Liu, Yang, Zhu, Chen and Pan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Bai, Xi Zhou, Zhibo Su, Mingliang Li, Yansheng Yang, Liuqing Liu, Kejia Yang, Hongbo Zhu, Huijuan Chen, Shi Pan, Hui Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms |
title | Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms |
title_full | Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms |
title_fullStr | Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms |
title_full_unstemmed | Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms |
title_short | Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms |
title_sort | predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9394741/ https://www.ncbi.nlm.nih.gov/pubmed/36003638 http://dx.doi.org/10.3389/fpubh.2022.940182 |
work_keys_str_mv | AT baixi predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT zhouzhibo predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT sumingliang predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT liyansheng predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT yangliuqing predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT liukejia predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT yanghongbo predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT zhuhuijuan predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT chenshi predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms AT panhui predictivemodelsforsmallforgestationalagebirthsinwomenexposedtopesticidesbeforepregnancybasedonmultiplemachinelearningalgorithms |