Cargando…

Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach

BACKGROUND: Advanced machine learning models have received wide attention in assisting medical decision making due to the greater accuracy they can achieve. However, their limited interpretability imposes barriers for practitioners to adopt them. Recent advancements in interpretable machine learning...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Xiaoquan, Alam, Sabriya, Shi, Pengyi, Dexter, Franklin, Kong, Nan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243084/
https://www.ncbi.nlm.nih.gov/pubmed/37277767
http://dx.doi.org/10.1186/s12911-023-02193-5
_version_ 1785054355316015104
author Gao, Xiaoquan
Alam, Sabriya
Shi, Pengyi
Dexter, Franklin
Kong, Nan
author_facet Gao, Xiaoquan
Alam, Sabriya
Shi, Pengyi
Dexter, Franklin
Kong, Nan
author_sort Gao, Xiaoquan
collection PubMed
description BACKGROUND: Advanced machine learning models have received wide attention in assisting medical decision making due to the greater accuracy they can achieve. However, their limited interpretability imposes barriers for practitioners to adopt them. Recent advancements in interpretable machine learning tools allow us to look inside the black box of advanced prediction methods to extract interpretable models while maintaining similar prediction accuracy, but few studies have investigated the specific hospital readmission prediction problem with this spirit. METHODS: Our goal is to develop a machine-learning (ML) algorithm that can predict 30- and 90- day hospital readmissions as accurately as black box algorithms while providing medically interpretable insights into readmission risk factors. Leveraging a state-of-art interpretable ML model, we use a two-step Extracted Regression Tree approach to achieve this goal. In the first step, we train a black box prediction algorithm. In the second step, we extract a regression tree from the output of the black box algorithm that allows direct interpretation of medically relevant risk factors. We use data from a large teaching hospital in Asia to learn the ML model and verify our two-step approach. RESULTS: The two-step method can obtain similar prediction performance as the best black box model, such as Neural Networks, measured by three metrics: accuracy, the Area Under the Curve (AUC) and the Area Under the Precision-Recall Curve (AUPRC), while maintaining interpretability. Further, to examine whether the prediction results match the known medical insights (i.e., the model is truly interpretable and produces reasonable results), we show that key readmission risk factors extracted by the two-step approach are consistent with those found in the medical literature. CONCLUSIONS: The proposed two-step approach yields meaningful prediction results that are both accurate and interpretable. This study suggests a viable means to improve the trust of machine learning based models in clinical practice for predicting readmissions through the two-step approach. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02193-5.
format Online
Article
Text
id pubmed-10243084
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102430842023-06-07 Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach Gao, Xiaoquan Alam, Sabriya Shi, Pengyi Dexter, Franklin Kong, Nan BMC Med Inform Decis Mak Research BACKGROUND: Advanced machine learning models have received wide attention in assisting medical decision making due to the greater accuracy they can achieve. However, their limited interpretability imposes barriers for practitioners to adopt them. Recent advancements in interpretable machine learning tools allow us to look inside the black box of advanced prediction methods to extract interpretable models while maintaining similar prediction accuracy, but few studies have investigated the specific hospital readmission prediction problem with this spirit. METHODS: Our goal is to develop a machine-learning (ML) algorithm that can predict 30- and 90- day hospital readmissions as accurately as black box algorithms while providing medically interpretable insights into readmission risk factors. Leveraging a state-of-art interpretable ML model, we use a two-step Extracted Regression Tree approach to achieve this goal. In the first step, we train a black box prediction algorithm. In the second step, we extract a regression tree from the output of the black box algorithm that allows direct interpretation of medically relevant risk factors. We use data from a large teaching hospital in Asia to learn the ML model and verify our two-step approach. RESULTS: The two-step method can obtain similar prediction performance as the best black box model, such as Neural Networks, measured by three metrics: accuracy, the Area Under the Curve (AUC) and the Area Under the Precision-Recall Curve (AUPRC), while maintaining interpretability. Further, to examine whether the prediction results match the known medical insights (i.e., the model is truly interpretable and produces reasonable results), we show that key readmission risk factors extracted by the two-step approach are consistent with those found in the medical literature. CONCLUSIONS: The proposed two-step approach yields meaningful prediction results that are both accurate and interpretable. This study suggests a viable means to improve the trust of machine learning based models in clinical practice for predicting readmissions through the two-step approach. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02193-5. BioMed Central 2023-06-05 /pmc/articles/PMC10243084/ /pubmed/37277767 http://dx.doi.org/10.1186/s12911-023-02193-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Gao, Xiaoquan
Alam, Sabriya
Shi, Pengyi
Dexter, Franklin
Kong, Nan
Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
title Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
title_full Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
title_fullStr Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
title_full_unstemmed Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
title_short Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
title_sort interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243084/
https://www.ncbi.nlm.nih.gov/pubmed/37277767
http://dx.doi.org/10.1186/s12911-023-02193-5
work_keys_str_mv AT gaoxiaoquan interpretablemachinelearningmodelsforhospitalreadmissionpredictionatwostepextractedregressiontreeapproach
AT alamsabriya interpretablemachinelearningmodelsforhospitalreadmissionpredictionatwostepextractedregressiontreeapproach
AT shipengyi interpretablemachinelearningmodelsforhospitalreadmissionpredictionatwostepextractedregressiontreeapproach
AT dexterfranklin interpretablemachinelearningmodelsforhospitalreadmissionpredictionatwostepextractedregressiontreeapproach
AT kongnan interpretablemachinelearningmodelsforhospitalreadmissionpredictionatwostepextractedregressiontreeapproach