Cargando…

External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants

Background: Postnatal gestational age (GA) algorithms derived from newborn metabolic profiles have emerged as a novel method of acquiring population-level preterm birth estimates in low resource settings. To date, model development and validation have been carried out in North American settings. Val...

Descripción completa

Detalles Bibliográficos
Autores principales: Hawken, Steven, Murphy, Malia S. Q., Ducharme, Robin, Bota, A. Brianne, Wilson, Lindsay A., Cheng, Wei, Tumulak, Ma-Am Joy, Alcausin, Maria Melanie Liberty, Reyes, Ma Elouisa, Qiu, Wenjuan, Potter, Beth K., Little, Julian, Walker, Mark, Zhang, Lin, Padilla, Carmencita, Chakraborty, Pranesh, Wilson, Kumanan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160452/
https://www.ncbi.nlm.nih.gov/pubmed/34104876
http://dx.doi.org/10.12688/gatesopenres.13131.2
_version_ 1783700290303688704
author Hawken, Steven
Murphy, Malia S. Q.
Ducharme, Robin
Bota, A. Brianne
Wilson, Lindsay A.
Cheng, Wei
Tumulak, Ma-Am Joy
Alcausin, Maria Melanie Liberty
Reyes, Ma Elouisa
Qiu, Wenjuan
Potter, Beth K.
Little, Julian
Walker, Mark
Zhang, Lin
Padilla, Carmencita
Chakraborty, Pranesh
Wilson, Kumanan
author_facet Hawken, Steven
Murphy, Malia S. Q.
Ducharme, Robin
Bota, A. Brianne
Wilson, Lindsay A.
Cheng, Wei
Tumulak, Ma-Am Joy
Alcausin, Maria Melanie Liberty
Reyes, Ma Elouisa
Qiu, Wenjuan
Potter, Beth K.
Little, Julian
Walker, Mark
Zhang, Lin
Padilla, Carmencita
Chakraborty, Pranesh
Wilson, Kumanan
author_sort Hawken, Steven
collection PubMed
description Background: Postnatal gestational age (GA) algorithms derived from newborn metabolic profiles have emerged as a novel method of acquiring population-level preterm birth estimates in low resource settings. To date, model development and validation have been carried out in North American settings. Validation outside of these settings is warranted.   Methods: This was a retrospective database study using data from newborn screening programs in Canada, the Philippines and China. ELASTICNET machine learning models were developed to estimate GA in a cohort of infants from Canada using sex, birth weight and metabolomic markers from newborn heel prick blood samples. Final models were internally validated in an independent sample of Canadian infants, and externally validated in infant cohorts from the Philippines and China.  Results: Cohorts included 39,666 infants from Canada, 82,909 from the Philippines and 4,448 from China.  For the full model including sex, birth weight and metabolomic markers, GA estimates were within ±5 days of ultrasound values in the Canadian internal validation (mean absolute error (MAE) 0.71, 95% CI: 0.71, 0.72), and within ±6 days of ultrasound GA in both the Filipino (0.90 (0.90, 0.91)) and Chinese cohorts (0.89 (0.86, 0.92)). Despite the decreased accuracy in external settings, our models incorporating metabolomic markers performed better than the baseline model, which relied on sex and birth weight alone. In preterm and growth-restricted infants, the accuracy of metabolomic models was markedly higher than the baseline model. Conclusions: Accuracy of metabolic GA algorithms was attenuated when applied in external settings.  Models including metabolomic markers demonstrated higher accuracy than models using sex and birth weight alone. As innovators look to take this work to scale, further investigation of modeling and data normalization techniques will be needed to improve robustness and generalizability of metabolomic GA estimates in low resource settings, where this could have the most clinical utility
format Online
Article
Text
id pubmed-8160452
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-81604522021-06-07 External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants Hawken, Steven Murphy, Malia S. Q. Ducharme, Robin Bota, A. Brianne Wilson, Lindsay A. Cheng, Wei Tumulak, Ma-Am Joy Alcausin, Maria Melanie Liberty Reyes, Ma Elouisa Qiu, Wenjuan Potter, Beth K. Little, Julian Walker, Mark Zhang, Lin Padilla, Carmencita Chakraborty, Pranesh Wilson, Kumanan Gates Open Res Research Article Background: Postnatal gestational age (GA) algorithms derived from newborn metabolic profiles have emerged as a novel method of acquiring population-level preterm birth estimates in low resource settings. To date, model development and validation have been carried out in North American settings. Validation outside of these settings is warranted.   Methods: This was a retrospective database study using data from newborn screening programs in Canada, the Philippines and China. ELASTICNET machine learning models were developed to estimate GA in a cohort of infants from Canada using sex, birth weight and metabolomic markers from newborn heel prick blood samples. Final models were internally validated in an independent sample of Canadian infants, and externally validated in infant cohorts from the Philippines and China.  Results: Cohorts included 39,666 infants from Canada, 82,909 from the Philippines and 4,448 from China.  For the full model including sex, birth weight and metabolomic markers, GA estimates were within ±5 days of ultrasound values in the Canadian internal validation (mean absolute error (MAE) 0.71, 95% CI: 0.71, 0.72), and within ±6 days of ultrasound GA in both the Filipino (0.90 (0.90, 0.91)) and Chinese cohorts (0.89 (0.86, 0.92)). Despite the decreased accuracy in external settings, our models incorporating metabolomic markers performed better than the baseline model, which relied on sex and birth weight alone. In preterm and growth-restricted infants, the accuracy of metabolomic models was markedly higher than the baseline model. Conclusions: Accuracy of metabolic GA algorithms was attenuated when applied in external settings.  Models including metabolomic markers demonstrated higher accuracy than models using sex and birth weight alone. As innovators look to take this work to scale, further investigation of modeling and data normalization techniques will be needed to improve robustness and generalizability of metabolomic GA estimates in low resource settings, where this could have the most clinical utility F1000 Research Limited 2021-06-21 /pmc/articles/PMC8160452/ /pubmed/34104876 http://dx.doi.org/10.12688/gatesopenres.13131.2 Text en Copyright: © 2021 Hawken S et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hawken, Steven
Murphy, Malia S. Q.
Ducharme, Robin
Bota, A. Brianne
Wilson, Lindsay A.
Cheng, Wei
Tumulak, Ma-Am Joy
Alcausin, Maria Melanie Liberty
Reyes, Ma Elouisa
Qiu, Wenjuan
Potter, Beth K.
Little, Julian
Walker, Mark
Zhang, Lin
Padilla, Carmencita
Chakraborty, Pranesh
Wilson, Kumanan
External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants
title External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants
title_full External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants
title_fullStr External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants
title_full_unstemmed External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants
title_short External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants
title_sort external validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in east and south-east asian infants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160452/
https://www.ncbi.nlm.nih.gov/pubmed/34104876
http://dx.doi.org/10.12688/gatesopenres.13131.2
work_keys_str_mv AT hawkensteven externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT murphymaliasq externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT ducharmerobin externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT botaabrianne externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT wilsonlindsaya externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT chengwei externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT tumulakmaamjoy externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT alcausinmariamelanieliberty externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT reyesmaelouisa externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT qiuwenjuan externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT potterbethk externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT littlejulian externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT walkermark externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT zhanglin externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT padillacarmencita externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT chakrabortypranesh externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants
AT wilsonkumanan externalvalidationofmachinelearningmodelsincludingnewbornmetabolomicmarkersforpostnatalgestationalageestimationineastandsoutheastasianinfants