Cargando…

Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015

Preterm birth is a global public health problem with a significant burden on the individuals affected. The study aimed to extend current research on preterm birth prognostic model development by developing and internally validating models using machine learning classification algorithms and populati...

Descripción completa

Detalles Bibliográficos
Autores principales: Wong, Kingsley, Tessema, Gizachew A., Chai, Kevin, Pereira, Gavin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9646808/
https://www.ncbi.nlm.nih.gov/pubmed/36352095
http://dx.doi.org/10.1038/s41598-022-23782-w
_version_ 1784827249293262848
author Wong, Kingsley
Tessema, Gizachew A.
Chai, Kevin
Pereira, Gavin
author_facet Wong, Kingsley
Tessema, Gizachew A.
Chai, Kevin
Pereira, Gavin
author_sort Wong, Kingsley
collection PubMed
description Preterm birth is a global public health problem with a significant burden on the individuals affected. The study aimed to extend current research on preterm birth prognostic model development by developing and internally validating models using machine learning classification algorithms and population-based routinely collected data in Western Australia. The longitudinal retrospective cohort study involved all births in Western Australia between 1980 and 2015, and the analytic sample contains 81,974 (8.6%) preterm births (< 37 weeks of gestation). Prediction models for preterm birth were developed using regularised logistic regression, decision trees, Random Forests, extreme gradient boosting, and multi-layer perceptron (MLP). Predictors included maternal socio-demographics and medical conditions, current and past pregnancy complications, and family history. Class weight was applied to handle imbalanced outcomes and stratified tenfold cross-validation was used to reduce overfitting. Close to half of the preterm births (49.1% at 5% FPR, 95% CI 48.9%,49.5%) were correctly classified by the best performing classifier (MLP) for all women when current pregnancy information was available. The sensitivity was boosted to 52.7% (95% CI 52.1%,53.3%) after including past obstetric history in a sub-population of births from multiparous women. Around half of the preterm birth can be identified antenatally at high specificity using population-based routinely collected maternal and pregnancy data. The performance of the prediction models depends on the available predictor pool that is individual and time specific.
format Online
Article
Text
id pubmed-9646808
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96468082022-11-15 Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 Wong, Kingsley Tessema, Gizachew A. Chai, Kevin Pereira, Gavin Sci Rep Article Preterm birth is a global public health problem with a significant burden on the individuals affected. The study aimed to extend current research on preterm birth prognostic model development by developing and internally validating models using machine learning classification algorithms and population-based routinely collected data in Western Australia. The longitudinal retrospective cohort study involved all births in Western Australia between 1980 and 2015, and the analytic sample contains 81,974 (8.6%) preterm births (< 37 weeks of gestation). Prediction models for preterm birth were developed using regularised logistic regression, decision trees, Random Forests, extreme gradient boosting, and multi-layer perceptron (MLP). Predictors included maternal socio-demographics and medical conditions, current and past pregnancy complications, and family history. Class weight was applied to handle imbalanced outcomes and stratified tenfold cross-validation was used to reduce overfitting. Close to half of the preterm births (49.1% at 5% FPR, 95% CI 48.9%,49.5%) were correctly classified by the best performing classifier (MLP) for all women when current pregnancy information was available. The sensitivity was boosted to 52.7% (95% CI 52.1%,53.3%) after including past obstetric history in a sub-population of births from multiparous women. Around half of the preterm birth can be identified antenatally at high specificity using population-based routinely collected maternal and pregnancy data. The performance of the prediction models depends on the available predictor pool that is individual and time specific. Nature Publishing Group UK 2022-11-09 /pmc/articles/PMC9646808/ /pubmed/36352095 http://dx.doi.org/10.1038/s41598-022-23782-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wong, Kingsley
Tessema, Gizachew A.
Chai, Kevin
Pereira, Gavin
Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
title Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
title_full Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
title_fullStr Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
title_full_unstemmed Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
title_short Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
title_sort development of prognostic model for preterm birth using machine learning in a population-based cohort of western australia births between 1980 and 2015
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9646808/
https://www.ncbi.nlm.nih.gov/pubmed/36352095
http://dx.doi.org/10.1038/s41598-022-23782-w
work_keys_str_mv AT wongkingsley developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015
AT tessemagizachewa developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015
AT chaikevin developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015
AT pereiragavin developmentofprognosticmodelforpretermbirthusingmachinelearninginapopulationbasedcohortofwesternaustraliabirthsbetween1980and2015