Cargando…
Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa
BACKGROUND: Babies born early and/or small for gestational age in Low and Middle-income countries (LMICs) contribute substantially to global neonatal and infant mortality. Tracking this metric is critical at a population level for informed policy, advocacy, resources allocation and program evaluatio...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8424940/ https://www.ncbi.nlm.nih.gov/pubmed/34493237 http://dx.doi.org/10.1186/s12884-021-04067-y |
_version_ | 1783749757095641088 |
---|---|
author | Sazawal, Sunil Ryckman, Kelli K. Das, Sayan Khanam, Rasheda Nisar, Imran Jasper, Elizabeth Dutta, Arup Rahman, Sayedur Mehmood, Usma Bedell, Bruce Deb, Saikat Chowdhury, Nabidul Haque Barkat, Amina Mittal, Harshita Ahmed, Salahuddin Khalid, Farah Raqib, Rubhana Manu, Alexander Yoshida, Sachiyo Ilyas, Muhammad Nizar, Ambreen Ali, Said Mohammed Baqui, Abdullah H. Jehan, Fyezah Dhingra, Usha Bahl, Rajiv |
author_facet | Sazawal, Sunil Ryckman, Kelli K. Das, Sayan Khanam, Rasheda Nisar, Imran Jasper, Elizabeth Dutta, Arup Rahman, Sayedur Mehmood, Usma Bedell, Bruce Deb, Saikat Chowdhury, Nabidul Haque Barkat, Amina Mittal, Harshita Ahmed, Salahuddin Khalid, Farah Raqib, Rubhana Manu, Alexander Yoshida, Sachiyo Ilyas, Muhammad Nizar, Ambreen Ali, Said Mohammed Baqui, Abdullah H. Jehan, Fyezah Dhingra, Usha Bahl, Rajiv |
author_sort | Sazawal, Sunil |
collection | PubMed |
description | BACKGROUND: Babies born early and/or small for gestational age in Low and Middle-income countries (LMICs) contribute substantially to global neonatal and infant mortality. Tracking this metric is critical at a population level for informed policy, advocacy, resources allocation and program evaluation and at an individual level for targeted care. Early prenatal ultrasound examination is not available in these settings, gestational age (GA) is estimated using new-born assessment, last menstrual period (LMP) recalls and birth weight, which are unreliable. Algorithms in developed settings, using metabolic screen data, provided GA estimates within 1–2 weeks of ultrasonography-based GA. We sought to leverage machine learning algorithms to improve accuracy and applicability of this approach to LMICs settings. METHODS: This study uses data from AMANHI-ACT, a prospective pregnancy cohorts in Asia and Africa where early pregnancy ultrasonography estimated GA and birth weight are available and metabolite screening data in a subset of 1318 new-borns were also available. We utilized this opportunity to develop machine learning (ML) algorithms. Random Forest Regressor was used where data was randomly split into model-building and model-testing dataset. Mean absolute error (MAE) and root mean square error (RMSE) were used to evaluate performance. Bootstrap procedures were used to estimate confidence intervals (CI) for RMSE and MAE. For pre-term birth identification ROC analysis with bootstrap and exact estimation of CI for area under curve (AUC) were performed. RESULTS: Overall model estimated GA had MAE of 5.2 days (95% CI 4.6–6.8), which was similar to performance in SGA, MAE 5.3 days (95% CI 4.6–6.2). GA was correctly estimated to within 1 week for 85.21% (95% CI 72.31–94.65). For preterm birth classification, AUC in ROC analysis was 98.1% (95% CI 96.0–99.0; p < 0.001). This model performed better than Iowa regression, AUC Difference 14.4% (95% CI 5–23.7; p = 0.002). CONCLUSIONS: Machine learning algorithms and models applied to metabolomic gestational age dating offer a ladder of opportunity for providing accurate population-level gestational age estimates in LMICs settings. These findings also point to an opportunity for investigation of region-specific models, more focused feasible analyte models, and broad untargeted metabolome investigation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12884-021-04067-y. |
format | Online Article Text |
id | pubmed-8424940 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-84249402021-09-10 Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa Sazawal, Sunil Ryckman, Kelli K. Das, Sayan Khanam, Rasheda Nisar, Imran Jasper, Elizabeth Dutta, Arup Rahman, Sayedur Mehmood, Usma Bedell, Bruce Deb, Saikat Chowdhury, Nabidul Haque Barkat, Amina Mittal, Harshita Ahmed, Salahuddin Khalid, Farah Raqib, Rubhana Manu, Alexander Yoshida, Sachiyo Ilyas, Muhammad Nizar, Ambreen Ali, Said Mohammed Baqui, Abdullah H. Jehan, Fyezah Dhingra, Usha Bahl, Rajiv BMC Pregnancy Childbirth Research BACKGROUND: Babies born early and/or small for gestational age in Low and Middle-income countries (LMICs) contribute substantially to global neonatal and infant mortality. Tracking this metric is critical at a population level for informed policy, advocacy, resources allocation and program evaluation and at an individual level for targeted care. Early prenatal ultrasound examination is not available in these settings, gestational age (GA) is estimated using new-born assessment, last menstrual period (LMP) recalls and birth weight, which are unreliable. Algorithms in developed settings, using metabolic screen data, provided GA estimates within 1–2 weeks of ultrasonography-based GA. We sought to leverage machine learning algorithms to improve accuracy and applicability of this approach to LMICs settings. METHODS: This study uses data from AMANHI-ACT, a prospective pregnancy cohorts in Asia and Africa where early pregnancy ultrasonography estimated GA and birth weight are available and metabolite screening data in a subset of 1318 new-borns were also available. We utilized this opportunity to develop machine learning (ML) algorithms. Random Forest Regressor was used where data was randomly split into model-building and model-testing dataset. Mean absolute error (MAE) and root mean square error (RMSE) were used to evaluate performance. Bootstrap procedures were used to estimate confidence intervals (CI) for RMSE and MAE. For pre-term birth identification ROC analysis with bootstrap and exact estimation of CI for area under curve (AUC) were performed. RESULTS: Overall model estimated GA had MAE of 5.2 days (95% CI 4.6–6.8), which was similar to performance in SGA, MAE 5.3 days (95% CI 4.6–6.2). GA was correctly estimated to within 1 week for 85.21% (95% CI 72.31–94.65). For preterm birth classification, AUC in ROC analysis was 98.1% (95% CI 96.0–99.0; p < 0.001). This model performed better than Iowa regression, AUC Difference 14.4% (95% CI 5–23.7; p = 0.002). CONCLUSIONS: Machine learning algorithms and models applied to metabolomic gestational age dating offer a ladder of opportunity for providing accurate population-level gestational age estimates in LMICs settings. These findings also point to an opportunity for investigation of region-specific models, more focused feasible analyte models, and broad untargeted metabolome investigation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12884-021-04067-y. BioMed Central 2021-09-07 /pmc/articles/PMC8424940/ /pubmed/34493237 http://dx.doi.org/10.1186/s12884-021-04067-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Sazawal, Sunil Ryckman, Kelli K. Das, Sayan Khanam, Rasheda Nisar, Imran Jasper, Elizabeth Dutta, Arup Rahman, Sayedur Mehmood, Usma Bedell, Bruce Deb, Saikat Chowdhury, Nabidul Haque Barkat, Amina Mittal, Harshita Ahmed, Salahuddin Khalid, Farah Raqib, Rubhana Manu, Alexander Yoshida, Sachiyo Ilyas, Muhammad Nizar, Ambreen Ali, Said Mohammed Baqui, Abdullah H. Jehan, Fyezah Dhingra, Usha Bahl, Rajiv Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa |
title | Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa |
title_full | Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa |
title_fullStr | Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa |
title_full_unstemmed | Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa |
title_short | Machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in South Asia and sub-Saharan Africa |
title_sort | machine learning guided postnatal gestational age assessment using new-born screening metabolomic data in south asia and sub-saharan africa |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8424940/ https://www.ncbi.nlm.nih.gov/pubmed/34493237 http://dx.doi.org/10.1186/s12884-021-04067-y |
work_keys_str_mv | AT sazawalsunil machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT ryckmankellik machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT dassayan machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT khanamrasheda machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT nisarimran machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT jasperelizabeth machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT duttaarup machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT rahmansayedur machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT mehmoodusma machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT bedellbruce machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT debsaikat machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT chowdhurynabidulhaque machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT barkatamina machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT mittalharshita machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT ahmedsalahuddin machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT khalidfarah machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT raqibrubhana machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT manualexander machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT yoshidasachiyo machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT ilyasmuhammad machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT nizarambreen machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT alisaidmohammed machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT baquiabdullahh machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT jehanfyezah machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT dhingrausha machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica AT bahlrajiv machinelearningguidedpostnatalgestationalageassessmentusingnewbornscreeningmetabolomicdatainsouthasiaandsubsaharanafrica |