Cargando…

Machine Learning Readmission Risk Modeling: A Pediatric Case Study

BACKGROUND: Hospital readmission prediction in pediatric hospitals has received little attention. Studies have focused on the readmission frequency analysis stratified by disease and demographic/geographic characteristics but there are no predictive modeling approaches, which may be useful to identi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wolff, Patricio, Graña, Manuel, Ríos, Sebastián A., Yarza, Maria Begoña
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6500604/
https://www.ncbi.nlm.nih.gov/pubmed/31139655
http://dx.doi.org/10.1155/2019/8532892
_version_ 1783415978243129344
author Wolff, Patricio
Graña, Manuel
Ríos, Sebastián A.
Yarza, Maria Begoña
author_facet Wolff, Patricio
Graña, Manuel
Ríos, Sebastián A.
Yarza, Maria Begoña
author_sort Wolff, Patricio
collection PubMed
description BACKGROUND: Hospital readmission prediction in pediatric hospitals has received little attention. Studies have focused on the readmission frequency analysis stratified by disease and demographic/geographic characteristics but there are no predictive modeling approaches, which may be useful to identify preventable readmissions that constitute a major portion of the cost attributed to readmissions. OBJECTIVE: To assess the all-cause readmission predictive performance achieved by machine learning techniques in the emergency department of a pediatric hospital in Santiago, Chile. MATERIALS: An all-cause admissions dataset has been collected along six consecutive years in a pediatric hospital in Santiago, Chile. The variables collected are the same used for the determination of the child's treatment administrative cost. METHODS: Retrospective predictive analysis of 30-day readmission was formulated as a binary classification problem. We report classification results achieved with various model building approaches after data curation and preprocessing for correction of class imbalance. We compute repeated cross-validation (RCV) with decreasing number of folders to assess performance and sensitivity to effect of imbalance in the test set and training set size. RESULTS: Increase in recall due to SMOTE class imbalance correction is large and statistically significant. The Naive Bayes (NB) approach achieves the best AUC (0.65); however the shallow multilayer perceptron has the best PPV and f-score (5.6 and 10.2, resp.). The NB and support vector machines (SVM) give comparable results if we consider AUC, PPV, and f-score ranking for all RCV experiments. High recall of deep multilayer perceptron is due to high false positive ratio. There is no detectable effect of the number of folds in the RCV on the predictive performance of the algorithms. CONCLUSIONS: We recommend the use of Naive Bayes (NB) with Gaussian distribution model as the most robust modeling approach for pediatric readmission prediction, achieving the best results across all training dataset sizes. The results show that the approach could be applied to detect preventable readmissions.
format Online
Article
Text
id pubmed-6500604
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-65006042019-05-28 Machine Learning Readmission Risk Modeling: A Pediatric Case Study Wolff, Patricio Graña, Manuel Ríos, Sebastián A. Yarza, Maria Begoña Biomed Res Int Research Article BACKGROUND: Hospital readmission prediction in pediatric hospitals has received little attention. Studies have focused on the readmission frequency analysis stratified by disease and demographic/geographic characteristics but there are no predictive modeling approaches, which may be useful to identify preventable readmissions that constitute a major portion of the cost attributed to readmissions. OBJECTIVE: To assess the all-cause readmission predictive performance achieved by machine learning techniques in the emergency department of a pediatric hospital in Santiago, Chile. MATERIALS: An all-cause admissions dataset has been collected along six consecutive years in a pediatric hospital in Santiago, Chile. The variables collected are the same used for the determination of the child's treatment administrative cost. METHODS: Retrospective predictive analysis of 30-day readmission was formulated as a binary classification problem. We report classification results achieved with various model building approaches after data curation and preprocessing for correction of class imbalance. We compute repeated cross-validation (RCV) with decreasing number of folders to assess performance and sensitivity to effect of imbalance in the test set and training set size. RESULTS: Increase in recall due to SMOTE class imbalance correction is large and statistically significant. The Naive Bayes (NB) approach achieves the best AUC (0.65); however the shallow multilayer perceptron has the best PPV and f-score (5.6 and 10.2, resp.). The NB and support vector machines (SVM) give comparable results if we consider AUC, PPV, and f-score ranking for all RCV experiments. High recall of deep multilayer perceptron is due to high false positive ratio. There is no detectable effect of the number of folds in the RCV on the predictive performance of the algorithms. CONCLUSIONS: We recommend the use of Naive Bayes (NB) with Gaussian distribution model as the most robust modeling approach for pediatric readmission prediction, achieving the best results across all training dataset sizes. The results show that the approach could be applied to detect preventable readmissions. Hindawi 2019-04-15 /pmc/articles/PMC6500604/ /pubmed/31139655 http://dx.doi.org/10.1155/2019/8532892 Text en Copyright © 2019 Patricio Wolff et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wolff, Patricio
Graña, Manuel
Ríos, Sebastián A.
Yarza, Maria Begoña
Machine Learning Readmission Risk Modeling: A Pediatric Case Study
title Machine Learning Readmission Risk Modeling: A Pediatric Case Study
title_full Machine Learning Readmission Risk Modeling: A Pediatric Case Study
title_fullStr Machine Learning Readmission Risk Modeling: A Pediatric Case Study
title_full_unstemmed Machine Learning Readmission Risk Modeling: A Pediatric Case Study
title_short Machine Learning Readmission Risk Modeling: A Pediatric Case Study
title_sort machine learning readmission risk modeling: a pediatric case study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6500604/
https://www.ncbi.nlm.nih.gov/pubmed/31139655
http://dx.doi.org/10.1155/2019/8532892
work_keys_str_mv AT wolffpatricio machinelearningreadmissionriskmodelingapediatriccasestudy
AT granamanuel machinelearningreadmissionriskmodelingapediatriccasestudy
AT riossebastiana machinelearningreadmissionriskmodelingapediatriccasestudy
AT yarzamariabegona machinelearningreadmissionriskmodelingapediatriccasestudy