Cargando…

Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)

Introduction: Preterm babies are a vulnerable population that experience significant short and long-term morbidity. Rehospitalisations constitute an important, potentially modifiable adverse event in this population. Improving the ability of clinicians to identify those patients at the greatest risk...

Descripción completa

Detalles Bibliográficos
Autores principales:	Reed, Robert A., Morgan, Andrei S., Zeitlin, Jennifer, Jarreau, Pierre-Henri, Torchin, Héloïse, Pierrat, Véronique, Ancel, Pierre-Yves, Khoshnood, Babak
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Pediatrics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7886676/ https://www.ncbi.nlm.nih.gov/pubmed/33614539 http://dx.doi.org/10.3389/fped.2020.585868

_version_	1783651846076760064
author	Reed, Robert A. Morgan, Andrei S. Zeitlin, Jennifer Jarreau, Pierre-Henri Torchin, Héloïse Pierrat, Véronique Ancel, Pierre-Yves Khoshnood, Babak
author_facet	Reed, Robert A. Morgan, Andrei S. Zeitlin, Jennifer Jarreau, Pierre-Henri Torchin, Héloïse Pierrat, Véronique Ancel, Pierre-Yves Khoshnood, Babak
author_sort	Reed, Robert A.
collection	PubMed
description	Introduction: Preterm babies are a vulnerable population that experience significant short and long-term morbidity. Rehospitalisations constitute an important, potentially modifiable adverse event in this population. Improving the ability of clinicians to identify those patients at the greatest risk of rehospitalisation has the potential to improve outcomes and reduce costs. Machine-learning algorithms can provide potentially advantageous methods of prediction compared to conventional approaches like logistic regression. Objective: To compare two machine-learning methods (least absolute shrinkage and selection operator (LASSO) and random forest) to expert-opinion driven logistic regression modelling for predicting unplanned rehospitalisation within 30 days in a large French cohort of preterm babies. Design, Setting and Participants: This study used data derived exclusively from the population-based prospective cohort study of French preterm babies, EPIPAGE 2. Only those babies discharged home alive and whose parents completed the 1-year survey were eligible for inclusion in our study. All predictive models used a binary outcome, denoting a baby's status for an unplanned rehospitalisation within 30 days of discharge. Predictors included those quantifying clinical, treatment, maternal and socio-demographic factors. The predictive abilities of models constructed using LASSO and random forest algorithms were compared with a traditional logistic regression model. The logistic regression model comprised 10 predictors, selected by expert clinicians, while the LASSO and random forest included 75 predictors. Performance measures were derived using 10-fold cross-validation. Performance was quantified using area under the receiver operator characteristic curve, sensitivity, specificity, Tjur's coefficient of determination and calibration measures. Results: The rate of 30-day unplanned rehospitalisation in the eligible population used to construct the models was 9.1% (95% CI 8.2–10.1) (350/3,841). The random forest model demonstrated both an improved AUROC (0.65; 95% CI 0.59–0.7; p = 0.03) and specificity vs. logistic regression (AUROC 0.57; 95% CI 0.51–0.62, p = 0.04). The LASSO performed similarly (AUROC 0.59; 95% CI 0.53–0.65; p = 0.68) to logistic regression. Conclusions: Compared to an expert-specified logistic regression model, random forest offered improved prediction of 30-day unplanned rehospitalisation in preterm babies. However, all models offered relatively low levels of predictive ability, regardless of modelling method.
format	Online Article Text
id	pubmed-7886676
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-78866762021-02-18 Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2) Reed, Robert A. Morgan, Andrei S. Zeitlin, Jennifer Jarreau, Pierre-Henri Torchin, Héloïse Pierrat, Véronique Ancel, Pierre-Yves Khoshnood, Babak Front Pediatr Pediatrics Introduction: Preterm babies are a vulnerable population that experience significant short and long-term morbidity. Rehospitalisations constitute an important, potentially modifiable adverse event in this population. Improving the ability of clinicians to identify those patients at the greatest risk of rehospitalisation has the potential to improve outcomes and reduce costs. Machine-learning algorithms can provide potentially advantageous methods of prediction compared to conventional approaches like logistic regression. Objective: To compare two machine-learning methods (least absolute shrinkage and selection operator (LASSO) and random forest) to expert-opinion driven logistic regression modelling for predicting unplanned rehospitalisation within 30 days in a large French cohort of preterm babies. Design, Setting and Participants: This study used data derived exclusively from the population-based prospective cohort study of French preterm babies, EPIPAGE 2. Only those babies discharged home alive and whose parents completed the 1-year survey were eligible for inclusion in our study. All predictive models used a binary outcome, denoting a baby's status for an unplanned rehospitalisation within 30 days of discharge. Predictors included those quantifying clinical, treatment, maternal and socio-demographic factors. The predictive abilities of models constructed using LASSO and random forest algorithms were compared with a traditional logistic regression model. The logistic regression model comprised 10 predictors, selected by expert clinicians, while the LASSO and random forest included 75 predictors. Performance measures were derived using 10-fold cross-validation. Performance was quantified using area under the receiver operator characteristic curve, sensitivity, specificity, Tjur's coefficient of determination and calibration measures. Results: The rate of 30-day unplanned rehospitalisation in the eligible population used to construct the models was 9.1% (95% CI 8.2–10.1) (350/3,841). The random forest model demonstrated both an improved AUROC (0.65; 95% CI 0.59–0.7; p = 0.03) and specificity vs. logistic regression (AUROC 0.57; 95% CI 0.51–0.62, p = 0.04). The LASSO performed similarly (AUROC 0.59; 95% CI 0.53–0.65; p = 0.68) to logistic regression. Conclusions: Compared to an expert-specified logistic regression model, random forest offered improved prediction of 30-day unplanned rehospitalisation in preterm babies. However, all models offered relatively low levels of predictive ability, regardless of modelling method. Frontiers Media S.A. 2021-02-03 /pmc/articles/PMC7886676/ /pubmed/33614539 http://dx.doi.org/10.3389/fped.2020.585868 Text en Copyright © 2021 Reed, Morgan, Zeitlin, Jarreau, Torchin, Pierrat, Ancel and Khoshnood. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Pediatrics Reed, Robert A. Morgan, Andrei S. Zeitlin, Jennifer Jarreau, Pierre-Henri Torchin, Héloïse Pierrat, Véronique Ancel, Pierre-Yves Khoshnood, Babak Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)
title	Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)
title_full	Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)
title_fullStr	Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)
title_full_unstemmed	Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)
title_short	Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)
title_sort	machine-learning vs. expert-opinion driven logistic regression modelling for predicting 30-day unplanned rehospitalisation in preterm babies: a prospective, population-based study (epipage 2)
topic	Pediatrics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7886676/ https://www.ncbi.nlm.nih.gov/pubmed/33614539 http://dx.doi.org/10.3389/fped.2020.585868
work_keys_str_mv	AT reedroberta machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2 AT morganandreis machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2 AT zeitlinjennifer machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2 AT jarreaupierrehenri machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2 AT torchinheloise machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2 AT pierratveronique machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2 AT ancelpierreyves machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2 AT khoshnoodbabak machinelearningvsexpertopiniondrivenlogisticregressionmodellingforpredicting30dayunplannedrehospitalisationinpretermbabiesaprospectivepopulationbasedstudyepipage2

Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2)

Ejemplares similares