Cargando…

Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning

PURPOSE: Predictive analytics (PA) is a new trending approach in the field of healthcare that uses machine learning to build a prediction model using supervised learning algorithms. Isolated coronary artery bypass grafting (iCABG), an open-heart surgery, is commonly performed in the treatment of cor...

Descripción completa

Detalles Bibliográficos
Autores principales: Alshakhs, Fatima, Alharthi, Hana, Aslam, Nida, Khan, Irfan Ullah, Elasheri, Mohamed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7537993/
https://www.ncbi.nlm.nih.gov/pubmed/33061545
http://dx.doi.org/10.2147/IJGM.S250334
_version_ 1783590777817923584
author Alshakhs, Fatima
Alharthi, Hana
Aslam, Nida
Khan, Irfan Ullah
Elasheri, Mohamed
author_facet Alshakhs, Fatima
Alharthi, Hana
Aslam, Nida
Khan, Irfan Ullah
Elasheri, Mohamed
author_sort Alshakhs, Fatima
collection PubMed
description PURPOSE: Predictive analytics (PA) is a new trending approach in the field of healthcare that uses machine learning to build a prediction model using supervised learning algorithms. Isolated coronary artery bypass grafting (iCABG), an open-heart surgery, is commonly performed in the treatment of coronary heart disease. AIM: The aim of this study was to develop and evaluate a model to predict postoperative length of stay (PLoS) for iCABG patients using supervised machine learning techniques, and to identify the features with the highest contribution to the model. METHODS: This is a retrospective study that uses historic data of adult patients who underwent isolated CABG (iCABG). After initial data pre-processing, data imputation using the kNN method was applied. The study used five prediction models using Naïve Bayes, Decision Tree, Random Forest, Logistic Regression and k Nearest Neighbor algorithms. Data imbalance was managed using the following widely used methods: oversampling, undersampling, “Both”, and random over-sampling examples (ROSE). The features selection process was conducted using the Boruta method. Two techniques were applied to examine the performance of the models, (70%, 30%) split and cross-validation, respectively. Models were evaluated by comparing their performance using AUC and other metrics. RESULTS: In the final dataset, six distinct features and 621 instances were used to develop the models. A total of 20 models were developed using R statistical software. The model generated using Random Forest with “Both” resampling method and cross-validation technique was deemed the best fit (AUC=0.81; F1 score=0.82; and recall=0.82). Attributes found to be highly predictive of PLoS were pulmonary artery systolic, age, height, EuroScore II, intra-aortic balloon pump used, and complications during operation. CONCLUSION: This study demonstrates the significance and effectiveness of building a model that predicts PLoS for iCABG patients using patient specifications and pre-/intra-operative measures.
format Online
Article
Text
id pubmed-7537993
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-75379932020-10-14 Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning Alshakhs, Fatima Alharthi, Hana Aslam, Nida Khan, Irfan Ullah Elasheri, Mohamed Int J Gen Med Original Research PURPOSE: Predictive analytics (PA) is a new trending approach in the field of healthcare that uses machine learning to build a prediction model using supervised learning algorithms. Isolated coronary artery bypass grafting (iCABG), an open-heart surgery, is commonly performed in the treatment of coronary heart disease. AIM: The aim of this study was to develop and evaluate a model to predict postoperative length of stay (PLoS) for iCABG patients using supervised machine learning techniques, and to identify the features with the highest contribution to the model. METHODS: This is a retrospective study that uses historic data of adult patients who underwent isolated CABG (iCABG). After initial data pre-processing, data imputation using the kNN method was applied. The study used five prediction models using Naïve Bayes, Decision Tree, Random Forest, Logistic Regression and k Nearest Neighbor algorithms. Data imbalance was managed using the following widely used methods: oversampling, undersampling, “Both”, and random over-sampling examples (ROSE). The features selection process was conducted using the Boruta method. Two techniques were applied to examine the performance of the models, (70%, 30%) split and cross-validation, respectively. Models were evaluated by comparing their performance using AUC and other metrics. RESULTS: In the final dataset, six distinct features and 621 instances were used to develop the models. A total of 20 models were developed using R statistical software. The model generated using Random Forest with “Both” resampling method and cross-validation technique was deemed the best fit (AUC=0.81; F1 score=0.82; and recall=0.82). Attributes found to be highly predictive of PLoS were pulmonary artery systolic, age, height, EuroScore II, intra-aortic balloon pump used, and complications during operation. CONCLUSION: This study demonstrates the significance and effectiveness of building a model that predicts PLoS for iCABG patients using patient specifications and pre-/intra-operative measures. Dove 2020-10-02 /pmc/articles/PMC7537993/ /pubmed/33061545 http://dx.doi.org/10.2147/IJGM.S250334 Text en © 2020 Alshakhs et al. http://creativecommons.org/licenses/by-nc/3.0/ This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Alshakhs, Fatima
Alharthi, Hana
Aslam, Nida
Khan, Irfan Ullah
Elasheri, Mohamed
Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning
title Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning
title_full Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning
title_fullStr Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning
title_full_unstemmed Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning
title_short Predicting Postoperative Length of Stay for Isolated Coronary Artery Bypass Graft Patients Using Machine Learning
title_sort predicting postoperative length of stay for isolated coronary artery bypass graft patients using machine learning
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7537993/
https://www.ncbi.nlm.nih.gov/pubmed/33061545
http://dx.doi.org/10.2147/IJGM.S250334
work_keys_str_mv AT alshakhsfatima predictingpostoperativelengthofstayforisolatedcoronaryarterybypassgraftpatientsusingmachinelearning
AT alharthihana predictingpostoperativelengthofstayforisolatedcoronaryarterybypassgraftpatientsusingmachinelearning
AT aslamnida predictingpostoperativelengthofstayforisolatedcoronaryarterybypassgraftpatientsusingmachinelearning
AT khanirfanullah predictingpostoperativelengthofstayforisolatedcoronaryarterybypassgraftpatientsusingmachinelearning
AT elasherimohamed predictingpostoperativelengthofstayforisolatedcoronaryarterybypassgraftpatientsusingmachinelearning