Cargando…

Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms

OBJECTIVES: The study aimed to develop and compare predictive models based on supervised machine learning algorithms for predicting the prolonged length of stay (LOS) of hospitalized patients diagnosed with five different chronic conditions. METHODS: An administrative claim dataset (2008–2012) of a...

Descripción completa

Detalles Bibliográficos
Autores principales: Symum, Hasan, Zayas-Castro, José L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Medical Informatics 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7010949/
https://www.ncbi.nlm.nih.gov/pubmed/32082697
http://dx.doi.org/10.4258/hir.2020.26.1.20
_version_ 1783495973703516160
author Symum, Hasan
Zayas-Castro, José L.
author_facet Symum, Hasan
Zayas-Castro, José L.
author_sort Symum, Hasan
collection PubMed
description OBJECTIVES: The study aimed to develop and compare predictive models based on supervised machine learning algorithms for predicting the prolonged length of stay (LOS) of hospitalized patients diagnosed with five different chronic conditions. METHODS: An administrative claim dataset (2008–2012) of a regional network of nine hospitals in the Tampa Bay area, Florida, USA, was used to develop the prediction models. Features were extracted from the dataset using the International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) codes. Five learning algorithms, namely, decision tree C5.0, linear support vector machine (LSVM), k-nearest neighbors, random forest, and multi-layered artificial neural networks, were used to build the model with semi-supervised anomaly detection and two feature selection methods. Issues with the unbalanced nature of the dataset were resolved using the Synthetic Minority Over-sampling Technique (SMOTE). RESULTS: LSVM with wrapper feature selection performed moderately well for all patient cohorts. Using SMOTE to counter data imbalances triggered a tradeoff between the model's sensitivity and specificity, which can be masked under a similar area under the curve. The proposed aggregate rank selection approach resulted in a balanced performing model compared to other criteria. Finally, factors such as comorbidity conditions, source of admission, and payer types were associated with the increased risk of a prolonged LOS. CONCLUSIONS: Prolonged LOS is mostly associated with pre-intraoperative clinical and patient socioeconomic factors. Accurate patient identification with the risk of prolonged LOS using the selected model can provide hospitals a better tool for planning early discharge and resource allocation, thus reducing avoidable hospitalization costs.
format Online
Article
Text
id pubmed-7010949
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Korean Society of Medical Informatics
record_format MEDLINE/PubMed
spelling pubmed-70109492020-02-20 Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms Symum, Hasan Zayas-Castro, José L. Healthc Inform Res Original Article OBJECTIVES: The study aimed to develop and compare predictive models based on supervised machine learning algorithms for predicting the prolonged length of stay (LOS) of hospitalized patients diagnosed with five different chronic conditions. METHODS: An administrative claim dataset (2008–2012) of a regional network of nine hospitals in the Tampa Bay area, Florida, USA, was used to develop the prediction models. Features were extracted from the dataset using the International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) codes. Five learning algorithms, namely, decision tree C5.0, linear support vector machine (LSVM), k-nearest neighbors, random forest, and multi-layered artificial neural networks, were used to build the model with semi-supervised anomaly detection and two feature selection methods. Issues with the unbalanced nature of the dataset were resolved using the Synthetic Minority Over-sampling Technique (SMOTE). RESULTS: LSVM with wrapper feature selection performed moderately well for all patient cohorts. Using SMOTE to counter data imbalances triggered a tradeoff between the model's sensitivity and specificity, which can be masked under a similar area under the curve. The proposed aggregate rank selection approach resulted in a balanced performing model compared to other criteria. Finally, factors such as comorbidity conditions, source of admission, and payer types were associated with the increased risk of a prolonged LOS. CONCLUSIONS: Prolonged LOS is mostly associated with pre-intraoperative clinical and patient socioeconomic factors. Accurate patient identification with the risk of prolonged LOS using the selected model can provide hospitals a better tool for planning early discharge and resource allocation, thus reducing avoidable hospitalization costs. Korean Society of Medical Informatics 2020-01 2020-01-31 /pmc/articles/PMC7010949/ /pubmed/32082697 http://dx.doi.org/10.4258/hir.2020.26.1.20 Text en © 2020 The Korean Society of Medical Informatics http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Symum, Hasan
Zayas-Castro, José L.
Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms
title Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms
title_full Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms
title_fullStr Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms
title_full_unstemmed Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms
title_short Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms
title_sort prediction of chronic disease-related inpatient prolonged length of stay using machine learning algorithms
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7010949/
https://www.ncbi.nlm.nih.gov/pubmed/32082697
http://dx.doi.org/10.4258/hir.2020.26.1.20
work_keys_str_mv AT symumhasan predictionofchronicdiseaserelatedinpatientprolongedlengthofstayusingmachinelearningalgorithms
AT zayascastrojosel predictionofchronicdiseaserelatedinpatientprolongedlengthofstayusingmachinelearningalgorithms