Cargando…
External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants
Background: Postnatal gestational age (GA) algorithms derived from newborn metabolic profiles have emerged as a novel method of acquiring population-level preterm birth estimates in low resource settings. To date, model development and validation have been carried out in North American settings. Val...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160452/ https://www.ncbi.nlm.nih.gov/pubmed/34104876 http://dx.doi.org/10.12688/gatesopenres.13131.2 |
_version_ | 1783700290303688704 |
---|---|
author | Hawken, Steven Murphy, Malia S. Q. Ducharme, Robin Bota, A. Brianne Wilson, Lindsay A. Cheng, Wei Tumulak, Ma-Am Joy Alcausin, Maria Melanie Liberty Reyes, Ma Elouisa Qiu, Wenjuan Potter, Beth K. Little, Julian Walker, Mark Zhang, Lin Padilla, Carmencita Chakraborty, Pranesh Wilson, Kumanan |
author_facet | Hawken, Steven Murphy, Malia S. Q. Ducharme, Robin Bota, A. Brianne Wilson, Lindsay A. Cheng, Wei Tumulak, Ma-Am Joy Alcausin, Maria Melanie Liberty Reyes, Ma Elouisa Qiu, Wenjuan Potter, Beth K. Little, Julian Walker, Mark Zhang, Lin Padilla, Carmencita Chakraborty, Pranesh Wilson, Kumanan |
author_sort | Hawken, Steven |
collection | PubMed |
description | Background: Postnatal gestational age (GA) algorithms derived from newborn metabolic profiles have emerged as a novel method of acquiring population-level preterm birth estimates in low resource settings. To date, model development and validation have been carried out in North American settings. Validation outside of these settings is warranted. Methods: This was a retrospective database study using data from newborn screening programs in Canada, the Philippines and China. ELASTICNET machine learning models were developed to estimate GA in a cohort of infants from Canada using sex, birth weight and metabolomic markers from newborn heel prick blood samples. Final models were internally validated in an independent sample of Canadian infants, and externally validated in infant cohorts from the Philippines and China. Results: Cohorts included 39,666 infants from Canada, 82,909 from the Philippines and 4,448 from China. For the full model including sex, birth weight and metabolomic markers, GA estimates were within ±5 days of ultrasound values in the Canadian internal validation (mean absolute error (MAE) 0.71, 95% CI: 0.71, 0.72), and within ±6 days of ultrasound GA in both the Filipino (0.90 (0.90, 0.91)) and Chinese cohorts (0.89 (0.86, 0.92)). Despite the decreased accuracy in external settings, our models incorporating metabolomic markers performed better than the baseline model, which relied on sex and birth weight alone. In preterm and growth-restricted infants, the accuracy of metabolomic models was markedly higher than the baseline model. Conclusions: Accuracy of metabolic GA algorithms was attenuated when applied in external settings. Models including metabolomic markers demonstrated higher accuracy than models using sex and birth weight alone. As innovators look to take this work to scale, further investigation of modeling and data normalization techniques will be needed to improve robustness and generalizability of metabolomic GA estimates in low resource settings, where this could have the most clinical utility |
format | Online Article Text |
id | pubmed-8160452 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-81604522021-06-07 External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants Hawken, Steven Murphy, Malia S. Q. Ducharme, Robin Bota, A. Brianne Wilson, Lindsay A. Cheng, Wei Tumulak, Ma-Am Joy Alcausin, Maria Melanie Liberty Reyes, Ma Elouisa Qiu, Wenjuan Potter, Beth K. Little, Julian Walker, Mark Zhang, Lin Padilla, Carmencita Chakraborty, Pranesh Wilson, Kumanan Gates Open Res Research Article Background: Postnatal gestational age (GA) algorithms derived from newborn metabolic profiles have emerged as a novel method of acquiring population-level preterm birth estimates in low resource settings. To date, model development and validation have been carried out in North American settings. Validation outside of these settings is warranted. Methods: This was a retrospective database study using data from newborn screening programs in Canada, the Philippines and China. ELASTICNET machine learning models were developed to estimate GA in a cohort of infants from Canada using sex, birth weight and metabolomic markers from newborn heel prick blood samples. Final models were internally validated in an independent sample of Canadian infants, and externally validated in infant cohorts from the Philippines and China. Results: Cohorts included 39,666 infants from Canada, 82,909 from the Philippines and 4,448 from China. For the full model including sex, birth weight and metabolomic markers, GA estimates were within ±5 days of ultrasound values in the Canadian internal validation (mean absolute error (MAE) 0.71, 95% CI: 0.71, 0.72), and within ±6 days of ultrasound GA in both the Filipino (0.90 (0.90, 0.91)) and Chinese cohorts (0.89 (0.86, 0.92)). Despite the decreased accuracy in external settings, our models incorporating metabolomic markers performed better than the baseline model, which relied on sex and birth weight alone. In preterm and growth-restricted infants, the accuracy of metabolomic models was markedly higher than the baseline model. Conclusions: Accuracy of metabolic GA algorithms was attenuated when applied in external settings. Models including metabolomic markers demonstrated higher accuracy than models using sex and birth weight alone. As innovators look to take this work to scale, further investigation of modeling and data normalization techniques will be needed to improve robustness and generalizability of metabolomic GA estimates in low resource settings, where this could have the most clinical utility F1000 Research Limited 2021-06-21 /pmc/articles/PMC8160452/ /pubmed/34104876 http://dx.doi.org/10.12688/gatesopenres.13131.2 Text en Copyright: © 2021 Hawken S et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Hawken, Steven Murphy, Malia S. Q. Ducharme, Robin Bota, A. Brianne Wilson, Lindsay A. Cheng, Wei Tumulak, Ma-Am Joy Alcausin, Maria Melanie Liberty Reyes, Ma Elouisa Qiu, Wenjuan Potter, Beth K. Little, Julian Walker, Mark Zhang, Lin Padilla, Carmencita Chakraborty, Pranesh Wilson, Kumanan External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants |
title | External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants |
title_full | External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants |
title_fullStr | External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants |
title_full_unstemmed | External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants |
title_short | External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants |
title_sort | external validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in east and south-east asian infants |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160452/ https://www.ncbi.nlm.nih.gov/pubmed/34104876 http://dx.doi.org/10.12688/gatesopenres.13131.2 |
work_keys_str_mv | AT hawkensteven externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT murphymaliasq externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT ducharmerobin externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT botaabrianne externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT wilsonlindsaya externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT chengwei externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT tumulakmaamjoy externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT alcausinmariamelanieliberty externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT reyesmaelouisa externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT qiuwenjuan externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT potterbethk externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT littlejulian externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT walkermark externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT zhanglin externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT padillacarmencita externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT chakrabortypranesh externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants AT wilsonkumanan externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants |