Cargando…
Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
Preterm birth is a global public health problem with a significant burden on the individuals affected. The study aimed to extend current research on preterm birth prognostic model development by developing and internally validating models using machine learning classification algorithms and populati...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9646808/ https://www.ncbi.nlm.nih.gov/pubmed/36352095 http://dx.doi.org/10.1038/s41598-022-23782-w |
_version_ | 1784827249293262848 |
---|---|
author | Wong, Kingsley Tessema, Gizachew A. Chai, Kevin Pereira, Gavin |
author_facet | Wong, Kingsley Tessema, Gizachew A. Chai, Kevin Pereira, Gavin |
author_sort | Wong, Kingsley |
collection | PubMed |
description | Preterm birth is a global public health problem with a significant burden on the individuals affected. The study aimed to extend current research on preterm birth prognostic model development by developing and internally validating models using machine learning classification algorithms and population-based routinely collected data in Western Australia. The longitudinal retrospective cohort study involved all births in Western Australia between 1980 and 2015, and the analytic sample contains 81,974 (8.6%) preterm births (< 37 weeks of gestation). Prediction models for preterm birth were developed using regularised logistic regression, decision trees, Random Forests, extreme gradient boosting, and multi-layer perceptron (MLP). Predictors included maternal socio-demographics and medical conditions, current and past pregnancy complications, and family history. Class weight was applied to handle imbalanced outcomes and stratified tenfold cross-validation was used to reduce overfitting. Close to half of the preterm births (49.1% at 5% FPR, 95% CI 48.9%,49.5%) were correctly classified by the best performing classifier (MLP) for all women when current pregnancy information was available. The sensitivity was boosted to 52.7% (95% CI 52.1%,53.3%) after including past obstetric history in a sub-population of births from multiparous women. Around half of the preterm birth can be identified antenatally at high specificity using population-based routinely collected maternal and pregnancy data. The performance of the prediction models depends on the available predictor pool that is individual and time specific. |
format | Online Article Text |
id | pubmed-9646808 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96468082022-11-15 Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 Wong, Kingsley Tessema, Gizachew A. Chai, Kevin Pereira, Gavin Sci Rep Article Preterm birth is a global public health problem with a significant burden on the individuals affected. The study aimed to extend current research on preterm birth prognostic model development by developing and internally validating models using machine learning classification algorithms and population-based routinely collected data in Western Australia. The longitudinal retrospective cohort study involved all births in Western Australia between 1980 and 2015, and the analytic sample contains 81,974 (8.6%) preterm births (< 37 weeks of gestation). Prediction models for preterm birth were developed using regularised logistic regression, decision trees, Random Forests, extreme gradient boosting, and multi-layer perceptron (MLP). Predictors included maternal socio-demographics and medical conditions, current and past pregnancy complications, and family history. Class weight was applied to handle imbalanced outcomes and stratified tenfold cross-validation was used to reduce overfitting. Close to half of the preterm births (49.1% at 5% FPR, 95% CI 48.9%,49.5%) were correctly classified by the best performing classifier (MLP) for all women when current pregnancy information was available. The sensitivity was boosted to 52.7% (95% CI 52.1%,53.3%) after including past obstetric history in a sub-population of births from multiparous women. Around half of the preterm birth can be identified antenatally at high specificity using population-based routinely collected maternal and pregnancy data. The performance of the prediction models depends on the available predictor pool that is individual and time specific. Nature Publishing Group UK 2022-11-09 /pmc/articles/PMC9646808/ /pubmed/36352095 http://dx.doi.org/10.1038/s41598-022-23782-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Wong, Kingsley Tessema, Gizachew A. Chai, Kevin Pereira, Gavin Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 |
title | Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 |
title_full | Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 |
title_fullStr | Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 |
title_full_unstemmed | Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 |
title_short | Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 |
title_sort | development of prognostic model for preterm birth using machine learning in a population-based cohort of western australia births between 1980 and 2015 |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9646808/ https://www.ncbi.nlm.nih.gov/pubmed/36352095 http://dx.doi.org/10.1038/s41598-022-23782-w |
work_keys_str_mv | AT wongkingsley developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015 AT tessemagizachewa developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015 AT chaikevin developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015 AT pereiragavin developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015 |