Cargando…

Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis

BACKGROUND: Predictions in pregnancy care are complex because of interactions among multiple factors. Hence, pregnancy outcomes are not easily predicted by a single predictor using only one algorithm or modeling method. OBJECTIVE: This study aims to review and compare the predictive performances bet...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sufriyana, Herdiantri, Husnayain, Atina, Chen, Ya-Lin, Kuo, Chao-Yang, Singh, Onkar, Yeh, Tso-Yang, Wu, Yu-Wei, Su, Emily Chia-Yu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2020
Materias:	Review
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708089/ https://www.ncbi.nlm.nih.gov/pubmed/33200995 http://dx.doi.org/10.2196/16503

_version_	1783617492455784448
author	Sufriyana, Herdiantri Husnayain, Atina Chen, Ya-Lin Kuo, Chao-Yang Singh, Onkar Yeh, Tso-Yang Wu, Yu-Wei Su, Emily Chia-Yu
author_facet	Sufriyana, Herdiantri Husnayain, Atina Chen, Ya-Lin Kuo, Chao-Yang Singh, Onkar Yeh, Tso-Yang Wu, Yu-Wei Su, Emily Chia-Yu
author_sort	Sufriyana, Herdiantri
collection	PubMed
description	BACKGROUND: Predictions in pregnancy care are complex because of interactions among multiple factors. Hence, pregnancy outcomes are not easily predicted by a single predictor using only one algorithm or modeling method. OBJECTIVE: This study aims to review and compare the predictive performances between logistic regression (LR) and other machine learning algorithms for developing or validating a multivariable prognostic prediction model for pregnancy care to inform clinicians’ decision making. METHODS: Research articles from MEDLINE, Scopus, Web of Science, and Google Scholar were reviewed following several guidelines for a prognostic prediction study, including a risk of bias (ROB) assessment. We report the results based on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Studies were primarily framed as PICOTS (population, index, comparator, outcomes, timing, and setting): Population: men or women in procreative management, pregnant women, and fetuses or newborns; Index: multivariable prognostic prediction models using non-LR algorithms for risk classification to inform clinicians’ decision making; Comparator: the models applying an LR; Outcomes: pregnancy-related outcomes of procreation or pregnancy outcomes for pregnant women and fetuses or newborns; Timing: pre-, inter-, and peripregnancy periods (predictors), at the pregnancy, delivery, and either puerperal or neonatal period (outcome), and either short- or long-term prognoses (time interval); and Setting: primary care or hospital. The results were synthesized by reporting study characteristics and ROBs and by random effects modeling of the difference of the logit area under the receiver operating characteristic curve of each non-LR model compared with the LR model for the same pregnancy outcomes. We also reported between-study heterogeneity by using τ(2) and I(2). RESULTS: Of the 2093 records, we included 142 studies for the systematic review and 62 studies for a meta-analysis. Most prediction models used LR (92/142, 64.8%) and artificial neural networks (20/142, 14.1%) among non-LR algorithms. Only 16.9% (24/142) of studies had a low ROB. A total of 2 non-LR algorithms from low ROB studies significantly outperformed LR. The first algorithm was a random forest for preterm delivery (logit AUROC 2.51, 95% CI 1.49-3.53; I(2)=86%; τ(2)=0.77) and pre-eclampsia (logit AUROC 1.2, 95% CI 0.72-1.67; I(2)=75%; τ(2)=0.09). The second algorithm was gradient boosting for cesarean section (logit AUROC 2.26, 95% CI 1.39-3.13; I(2)=75%; τ(2)=0.43) and gestational diabetes (logit AUROC 1.03, 95% CI 0.69-1.37; I(2)=83%; τ(2)=0.07). CONCLUSIONS: Prediction models with the best performances across studies were not necessarily those that used LR but also used random forest and gradient boosting that also performed well. We recommend a reanalysis of existing LR models for several pregnancy outcomes by comparing them with those algorithms that apply standard guidelines. TRIAL REGISTRATION: PROSPERO (International Prospective Register of Systematic Reviews) CRD42019136106; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=136106
format	Online Article Text
id	pubmed-7708089
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-77080892020-12-04 Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis Sufriyana, Herdiantri Husnayain, Atina Chen, Ya-Lin Kuo, Chao-Yang Singh, Onkar Yeh, Tso-Yang Wu, Yu-Wei Su, Emily Chia-Yu JMIR Med Inform Review BACKGROUND: Predictions in pregnancy care are complex because of interactions among multiple factors. Hence, pregnancy outcomes are not easily predicted by a single predictor using only one algorithm or modeling method. OBJECTIVE: This study aims to review and compare the predictive performances between logistic regression (LR) and other machine learning algorithms for developing or validating a multivariable prognostic prediction model for pregnancy care to inform clinicians’ decision making. METHODS: Research articles from MEDLINE, Scopus, Web of Science, and Google Scholar were reviewed following several guidelines for a prognostic prediction study, including a risk of bias (ROB) assessment. We report the results based on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Studies were primarily framed as PICOTS (population, index, comparator, outcomes, timing, and setting): Population: men or women in procreative management, pregnant women, and fetuses or newborns; Index: multivariable prognostic prediction models using non-LR algorithms for risk classification to inform clinicians’ decision making; Comparator: the models applying an LR; Outcomes: pregnancy-related outcomes of procreation or pregnancy outcomes for pregnant women and fetuses or newborns; Timing: pre-, inter-, and peripregnancy periods (predictors), at the pregnancy, delivery, and either puerperal or neonatal period (outcome), and either short- or long-term prognoses (time interval); and Setting: primary care or hospital. The results were synthesized by reporting study characteristics and ROBs and by random effects modeling of the difference of the logit area under the receiver operating characteristic curve of each non-LR model compared with the LR model for the same pregnancy outcomes. We also reported between-study heterogeneity by using τ(2) and I(2). RESULTS: Of the 2093 records, we included 142 studies for the systematic review and 62 studies for a meta-analysis. Most prediction models used LR (92/142, 64.8%) and artificial neural networks (20/142, 14.1%) among non-LR algorithms. Only 16.9% (24/142) of studies had a low ROB. A total of 2 non-LR algorithms from low ROB studies significantly outperformed LR. The first algorithm was a random forest for preterm delivery (logit AUROC 2.51, 95% CI 1.49-3.53; I(2)=86%; τ(2)=0.77) and pre-eclampsia (logit AUROC 1.2, 95% CI 0.72-1.67; I(2)=75%; τ(2)=0.09). The second algorithm was gradient boosting for cesarean section (logit AUROC 2.26, 95% CI 1.39-3.13; I(2)=75%; τ(2)=0.43) and gestational diabetes (logit AUROC 1.03, 95% CI 0.69-1.37; I(2)=83%; τ(2)=0.07). CONCLUSIONS: Prediction models with the best performances across studies were not necessarily those that used LR but also used random forest and gradient boosting that also performed well. We recommend a reanalysis of existing LR models for several pregnancy outcomes by comparing them with those algorithms that apply standard guidelines. TRIAL REGISTRATION: PROSPERO (International Prospective Register of Systematic Reviews) CRD42019136106; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=136106 JMIR Publications 2020-11-17 /pmc/articles/PMC7708089/ /pubmed/33200995 http://dx.doi.org/10.2196/16503 Text en ©Herdiantri Sufriyana, Atina Husnayain, Ya-Lin Chen, Chao-Yang Kuo, Onkar Singh, Tso-Yang Yeh, Yu-Wei Wu, Emily Chia-Yu Su. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 17.11.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Review Sufriyana, Herdiantri Husnayain, Atina Chen, Ya-Lin Kuo, Chao-Yang Singh, Onkar Yeh, Tso-Yang Wu, Yu-Wei Su, Emily Chia-Yu Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title	Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_full	Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_fullStr	Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_full_unstemmed	Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_short	Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_sort	comparison of multivariable logistic regression and other machine learning algorithms for prognostic prediction studies in pregnancy care: systematic review and meta-analysis
topic	Review
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708089/ https://www.ncbi.nlm.nih.gov/pubmed/33200995 http://dx.doi.org/10.2196/16503
work_keys_str_mv	AT sufriyanaherdiantri comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis AT husnayainatina comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis AT chenyalin comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis AT kuochaoyang comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis AT singhonkar comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis AT yehtsoyang comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis AT wuyuwei comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis AT suemilychiayu comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis

Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis

Ejemplares similares