Cargando…

Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa

BACKGROUND: Babies born early and/or small for gestational age in Low and Middle-income countries (LMICs) contribute substantially to global neonatal and infant mortality. Tracking this metric is critical at a population level for informed policy, advocacy, resources allocation and program evaluatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Sazawal, Sunil, Ryckman, Kelli K., Das, Sayan, Khanam, Rasheda, Nisar, Imran, Jasper, Elizabeth, Dutta, Arup, Rahman, Sayedur, Mehmood, Usma, Bedell, Bruce, Deb, Saikat, Chowdhury, Nabidul Haque, Barkat, Amina, Mittal, Harshita, Ahmed, Salahuddin, Khalid, Farah, Raqib, Rubhana, Manu, Alexander, Yoshida, Sachiyo, Ilyas, Muhammad, Nizar, Ambreen, Ali, Said Mohammed, Baqui, Abdullah H., Jehan, Fyezah, Dhingra, Usha, Bahl, Rajiv
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8424940/
https://www.ncbi.nlm.nih.gov/pubmed/34493237
http://dx.doi.org/10.1186/s12884-021-04067-y
_version_ 1783749757095641088
author Sazawal, Sunil
Ryckman, Kelli K.
Das, Sayan
Khanam, Rasheda
Nisar, Imran
Jasper, Elizabeth
Dutta, Arup
Rahman, Sayedur
Mehmood, Usma
Bedell, Bruce
Deb, Saikat
Chowdhury, Nabidul Haque
Barkat, Amina
Mittal, Harshita
Ahmed, Salahuddin
Khalid, Farah
Raqib, Rubhana
Manu, Alexander
Yoshida, Sachiyo
Ilyas, Muhammad
Nizar, Ambreen
Ali, Said Mohammed
Baqui, Abdullah H.
Jehan, Fyezah
Dhingra, Usha
Bahl, Rajiv
author_facet Sazawal, Sunil
Ryckman, Kelli K.
Das, Sayan
Khanam, Rasheda
Nisar, Imran
Jasper, Elizabeth
Dutta, Arup
Rahman, Sayedur
Mehmood, Usma
Bedell, Bruce
Deb, Saikat
Chowdhury, Nabidul Haque
Barkat, Amina
Mittal, Harshita
Ahmed, Salahuddin
Khalid, Farah
Raqib, Rubhana
Manu, Alexander
Yoshida, Sachiyo
Ilyas, Muhammad
Nizar, Ambreen
Ali, Said Mohammed
Baqui, Abdullah H.
Jehan, Fyezah
Dhingra, Usha
Bahl, Rajiv
author_sort Sazawal, Sunil
collection PubMed
description BACKGROUND: Babies born early and/or small for gestational age in Low and Middle-income countries (LMICs) contribute substantially to global neonatal and infant mortality. Tracking this metric is critical at a population level for informed policy, advocacy, resources allocation and program evaluation and at an individual level for targeted care. Early prenatal ultrasound examination is not available in these settings, gestational age (GA) is estimated using new-born assessment, last menstrual period (LMP) recalls and birth weight, which are unreliable. Algorithms in developed settings, using metabolic screen data, provided GA estimates within 1–2 weeks of ultrasonography-based GA. We sought to leverage machine learning algorithms to improve accuracy and applicability of this approach to LMICs settings. METHODS: This study uses data from AMANHI-ACT, a prospective pregnancy cohorts in Asia and Africa where early pregnancy ultrasonography estimated GA and birth weight are available and metabolite screening data in a subset of 1318 new-borns were also available. We utilized this opportunity to develop machine learning (ML) algorithms. Random Forest Regressor was used where data was randomly split into model-building and model-testing dataset. Mean absolute error (MAE) and root mean square error (RMSE) were used to evaluate performance. Bootstrap procedures were used to estimate confidence intervals (CI) for RMSE and MAE. For pre-term birth identification ROC analysis with bootstrap and exact estimation of CI for area under curve (AUC) were performed. RESULTS: Overall model estimated GA had MAE of 5.2 days (95% CI 4.6–6.8), which was similar to performance in SGA, MAE 5.3 days (95% CI 4.6–6.2). GA was correctly estimated to within 1 week for 85.21% (95% CI 72.31–94.65). For preterm birth classification, AUC in ROC analysis was 98.1% (95% CI 96.0–99.0; p < 0.001). This model performed better than Iowa regression, AUC Difference 14.4% (95% CI 5–23.7; p = 0.002). CONCLUSIONS: Machine learning algorithms and models applied to metabolomic gestational age dating offer a ladder of opportunity for providing accurate population-level gestational age estimates in LMICs settings. These findings also point to an opportunity for investigation of region-specific models, more focused feasible analyte models, and broad untargeted metabolome investigation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12884-021-04067-y.
format Online
Article
Text
id pubmed-8424940
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84249402021-09-10 Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa Sazawal, Sunil Ryckman, Kelli K. Das, Sayan Khanam, Rasheda Nisar, Imran Jasper, Elizabeth Dutta, Arup Rahman, Sayedur Mehmood, Usma Bedell, Bruce Deb, Saikat Chowdhury, Nabidul Haque Barkat, Amina Mittal, Harshita Ahmed, Salahuddin Khalid, Farah Raqib, Rubhana Manu, Alexander Yoshida, Sachiyo Ilyas, Muhammad Nizar, Ambreen Ali, Said Mohammed Baqui, Abdullah H. Jehan, Fyezah Dhingra, Usha Bahl, Rajiv BMC Pregnancy Childbirth Research BACKGROUND: Babies born early and/or small for gestational age in Low and Middle-income countries (LMICs) contribute substantially to global neonatal and infant mortality. Tracking this metric is critical at a population level for informed policy, advocacy, resources allocation and program evaluation and at an individual level for targeted care. Early prenatal ultrasound examination is not available in these settings, gestational age (GA) is estimated using new-born assessment, last menstrual period (LMP) recalls and birth weight, which are unreliable. Algorithms in developed settings, using metabolic screen data, provided GA estimates within 1–2 weeks of ultrasonography-based GA. We sought to leverage machine learning algorithms to improve accuracy and applicability of this approach to LMICs settings. METHODS: This study uses data from AMANHI-ACT, a prospective pregnancy cohorts in Asia and Africa where early pregnancy ultrasonography estimated GA and birth weight are available and metabolite screening data in a subset of 1318 new-borns were also available. We utilized this opportunity to develop machine learning (ML) algorithms. Random Forest Regressor was used where data was randomly split into model-building and model-testing dataset. Mean absolute error (MAE) and root mean square error (RMSE) were used to evaluate performance. Bootstrap procedures were used to estimate confidence intervals (CI) for RMSE and MAE. For pre-term birth identification ROC analysis with bootstrap and exact estimation of CI for area under curve (AUC) were performed. RESULTS: Overall model estimated GA had MAE of 5.2 days (95% CI 4.6–6.8), which was similar to performance in SGA, MAE 5.3 days (95% CI 4.6–6.2). GA was correctly estimated to within 1 week for 85.21% (95% CI 72.31–94.65). For preterm birth classification, AUC in ROC analysis was 98.1% (95% CI 96.0–99.0; p < 0.001). This model performed better than Iowa regression, AUC Difference 14.4% (95% CI 5–23.7; p = 0.002). CONCLUSIONS: Machine learning algorithms and models applied to metabolomic gestational age dating offer a ladder of opportunity for providing accurate population-level gestational age estimates in LMICs settings. These findings also point to an opportunity for investigation of region-specific models, more focused feasible analyte models, and broad untargeted metabolome investigation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12884-021-04067-y. BioMed Central 2021-09-07 /pmc/articles/PMC8424940/ /pubmed/34493237 http://dx.doi.org/10.1186/s12884-021-04067-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Sazawal, Sunil
Ryckman, Kelli K.
Das, Sayan
Khanam, Rasheda
Nisar, Imran
Jasper, Elizabeth
Dutta, Arup
Rahman, Sayedur
Mehmood, Usma
Bedell, Bruce
Deb, Saikat
Chowdhury, Nabidul Haque
Barkat, Amina
Mittal, Harshita
Ahmed, Salahuddin
Khalid, Farah
Raqib, Rubhana
Manu, Alexander
Yoshida, Sachiyo
Ilyas, Muhammad
Nizar, Ambreen
Ali, Said Mohammed
Baqui, Abdullah H.
Jehan, Fyezah
Dhingra, Usha
Bahl, Rajiv
Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa
title Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa
title_full Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa
title_fullStr Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa
title_full_unstemmed Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa
title_short Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa
title_sort machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in south asia and sub-saharan africa
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8424940/
https://www.ncbi.nlm.nih.gov/pubmed/34493237
http://dx.doi.org/10.1186/s12884-021-04067-y
work_keys_str_mv AT sazawalsunil machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT ryckmankellik machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT dassayan machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT khanamrasheda machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT nisarimran machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT jasperelizabeth machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT duttaarup machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT rahmansayedur machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT mehmoodusma machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT bedellbruce machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT debsaikat machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT chowdhurynabidulhaque machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT barkatamina machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT mittalharshita machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT ahmedsalahuddin machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT khalidfarah machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT raqibrubhana machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT manualexander machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT yoshidasachiyo machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT ilyasmuhammad machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT nizarambreen machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT alisaidmohammed machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT baquiabdullahh machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT jehanfyezah machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT dhingrausha machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica
AT bahlrajiv machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica