Cargando…

Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm

The length of hospital stay (LOS) is a significant indicator of the quality of patient care, hospital efficiency, and operational resilience. Considering the importance of LOS in hospital resource management, this research aims to improve the accuracy of LOS prediction using hyperparameter optimizat...

Descripción completa

Detalles Bibliográficos
Autores principales: Mansoori, Atefeh, Zeinalnezhad, Masoomeh, Nazarimanesh, Leila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9943622/
https://www.ncbi.nlm.nih.gov/pubmed/36824405
http://dx.doi.org/10.1155/2023/9673395
_version_ 1784891746515156992
author Mansoori, Atefeh
Zeinalnezhad, Masoomeh
Nazarimanesh, Leila
author_facet Mansoori, Atefeh
Zeinalnezhad, Masoomeh
Nazarimanesh, Leila
author_sort Mansoori, Atefeh
collection PubMed
description The length of hospital stay (LOS) is a significant indicator of the quality of patient care, hospital efficiency, and operational resilience. Considering the importance of LOS in hospital resource management, this research aims to improve the accuracy of LOS prediction using hyperparameter optimization (HPO). Expert physicians and related studies were reviewed to determine the variables affecting LOS. The electronic medical records of 200 patients in the department of internal medicine of a hospital in Iran were collected randomly. As the performance of machine learning (ML) models can vary based on the characteristics of the features, several models were applied and evaluated in this study. In particular, k-nearest neighbors (KNN), multivariate regression, decision tree (DT), random forest (RF), artificial neural network (ANN), and XGBoost have been evaluated and improved. The genetic algorithm (GA) was applied to optimize the tree-based models. In addition, the dummy coding technique, sometimes called the One-Hot encoding, was used to encode categorical features to increase prediction accuracy. Compared with other algorithms, the XGBoost model optimized by GA (XGB_GA) achieved higher accuracy and better prediction performance. The mean and median of absolute errors in the test dataset for this model were 1.54 and 1.14 days, respectively. In other words, the XGB_GA model reduced the mean absolute error by 37%, which is beneficial in the reliable design of a clinical decision support system.
format Online
Article
Text
id pubmed-9943622
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-99436222023-02-22 Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm Mansoori, Atefeh Zeinalnezhad, Masoomeh Nazarimanesh, Leila J Healthc Eng Research Article The length of hospital stay (LOS) is a significant indicator of the quality of patient care, hospital efficiency, and operational resilience. Considering the importance of LOS in hospital resource management, this research aims to improve the accuracy of LOS prediction using hyperparameter optimization (HPO). Expert physicians and related studies were reviewed to determine the variables affecting LOS. The electronic medical records of 200 patients in the department of internal medicine of a hospital in Iran were collected randomly. As the performance of machine learning (ML) models can vary based on the characteristics of the features, several models were applied and evaluated in this study. In particular, k-nearest neighbors (KNN), multivariate regression, decision tree (DT), random forest (RF), artificial neural network (ANN), and XGBoost have been evaluated and improved. The genetic algorithm (GA) was applied to optimize the tree-based models. In addition, the dummy coding technique, sometimes called the One-Hot encoding, was used to encode categorical features to increase prediction accuracy. Compared with other algorithms, the XGBoost model optimized by GA (XGB_GA) achieved higher accuracy and better prediction performance. The mean and median of absolute errors in the test dataset for this model were 1.54 and 1.14 days, respectively. In other words, the XGB_GA model reduced the mean absolute error by 37%, which is beneficial in the reliable design of a clinical decision support system. Hindawi 2023-02-14 /pmc/articles/PMC9943622/ /pubmed/36824405 http://dx.doi.org/10.1155/2023/9673395 Text en Copyright © 2023 Atefeh Mansoori et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mansoori, Atefeh
Zeinalnezhad, Masoomeh
Nazarimanesh, Leila
Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm
title Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm
title_full Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm
title_fullStr Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm
title_full_unstemmed Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm
title_short Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm
title_sort optimization of tree-based machine learning models to predict the length of hospital stay using genetic algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9943622/
https://www.ncbi.nlm.nih.gov/pubmed/36824405
http://dx.doi.org/10.1155/2023/9673395
work_keys_str_mv AT mansooriatefeh optimizationoftreebasedmachinelearningmodelstopredictthelengthofhospitalstayusinggeneticalgorithm
AT zeinalnezhadmasoomeh optimizationoftreebasedmachinelearningmodelstopredictthelengthofhospitalstayusinggeneticalgorithm
AT nazarimaneshleila optimizationoftreebasedmachinelearningmodelstopredictthelengthofhospitalstayusinggeneticalgorithm