Cargando…
Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach
BACKGROUND: Intensive care unit (ICU) patients demand continuous monitoring of several clinical and laboratory parameters that directly influence their medical progress and the staff’s decision-making. Those data are vital in the assistance of these patients, being already used by several scoring sy...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Baishideng Publishing Group Inc
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483004/ https://www.ncbi.nlm.nih.gov/pubmed/36160934 http://dx.doi.org/10.5492/wjccm.v11.i5.317 |
_version_ | 1784791576821628928 |
---|---|
author | Caires Silveira, Elena Mattos Pretti, Soraya Santos, Bruna Almeida Santos Corrêa, Caio Fellipe Madureira Silva, Leonardo Freire de Melo, Fabrício |
author_facet | Caires Silveira, Elena Mattos Pretti, Soraya Santos, Bruna Almeida Santos Corrêa, Caio Fellipe Madureira Silva, Leonardo Freire de Melo, Fabrício |
author_sort | Caires Silveira, Elena |
collection | PubMed |
description | BACKGROUND: Intensive care unit (ICU) patients demand continuous monitoring of several clinical and laboratory parameters that directly influence their medical progress and the staff’s decision-making. Those data are vital in the assistance of these patients, being already used by several scoring systems. In this context, machine learning approaches have been used for medical predictions based on clinical data, which includes patient outcomes. AIM: To develop a binary classifier for the outcome of death in ICU patients based on clinical and laboratory parameters, a set formed by 1087 instances and 50 variables from ICU patients admitted to the emergency department was obtained in the “WiDS (Women in Data Science) Datathon 2020: ICU Mortality Prediction” dataset. METHODS: For categorical variables, frequencies and risk ratios were calculated. Numerical variables were computed as means and standard deviations and Mann-Whitney U tests were performed. We then divided the data into a training (80%) and test (20%) set. The training set was used to train a predictive model based on the Random Forest algorithm and the test set was used to evaluate the predictive effectiveness of the model. RESULTS: A statistically significant association was identified between need for intubation, as well predominant systemic cardiovascular involvement, and hospital death. A number of the numerical variables analyzed (for instance Glasgow Coma Score punctuations, mean arterial pressure, temperature, pH, and lactate, creatinine, albumin and bilirubin values) were also significantly associated with death outcome. The proposed binary Random Forest classifier obtained on the test set (n = 218) had an accuracy of 80.28%, sensitivity of 81.82%, specificity of 79.43%, positive predictive value of 73.26%, negative predictive value of 84.85%, F1 score of 0.74, and area under the curve score of 0.85. The predictive variables of the greatest importance were the maximum and minimum lactate values, adding up to a predictive importance of 15.54%. CONCLUSION: We demonstrated the efficacy of a Random Forest machine learning algorithm for handling clinical and laboratory data from patients under intensive monitoring. Therefore, we endorse the emerging notion that machine learning has great potential to provide us support to critically question existing methodologies, allowing improvements that reduce mortality. |
format | Online Article Text |
id | pubmed-9483004 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Baishideng Publishing Group Inc |
record_format | MEDLINE/PubMed |
spelling | pubmed-94830042022-09-23 Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach Caires Silveira, Elena Mattos Pretti, Soraya Santos, Bruna Almeida Santos Corrêa, Caio Fellipe Madureira Silva, Leonardo Freire de Melo, Fabrício World J Crit Care Med Retrospective Study BACKGROUND: Intensive care unit (ICU) patients demand continuous monitoring of several clinical and laboratory parameters that directly influence their medical progress and the staff’s decision-making. Those data are vital in the assistance of these patients, being already used by several scoring systems. In this context, machine learning approaches have been used for medical predictions based on clinical data, which includes patient outcomes. AIM: To develop a binary classifier for the outcome of death in ICU patients based on clinical and laboratory parameters, a set formed by 1087 instances and 50 variables from ICU patients admitted to the emergency department was obtained in the “WiDS (Women in Data Science) Datathon 2020: ICU Mortality Prediction” dataset. METHODS: For categorical variables, frequencies and risk ratios were calculated. Numerical variables were computed as means and standard deviations and Mann-Whitney U tests were performed. We then divided the data into a training (80%) and test (20%) set. The training set was used to train a predictive model based on the Random Forest algorithm and the test set was used to evaluate the predictive effectiveness of the model. RESULTS: A statistically significant association was identified between need for intubation, as well predominant systemic cardiovascular involvement, and hospital death. A number of the numerical variables analyzed (for instance Glasgow Coma Score punctuations, mean arterial pressure, temperature, pH, and lactate, creatinine, albumin and bilirubin values) were also significantly associated with death outcome. The proposed binary Random Forest classifier obtained on the test set (n = 218) had an accuracy of 80.28%, sensitivity of 81.82%, specificity of 79.43%, positive predictive value of 73.26%, negative predictive value of 84.85%, F1 score of 0.74, and area under the curve score of 0.85. The predictive variables of the greatest importance were the maximum and minimum lactate values, adding up to a predictive importance of 15.54%. CONCLUSION: We demonstrated the efficacy of a Random Forest machine learning algorithm for handling clinical and laboratory data from patients under intensive monitoring. Therefore, we endorse the emerging notion that machine learning has great potential to provide us support to critically question existing methodologies, allowing improvements that reduce mortality. Baishideng Publishing Group Inc 2022-09-09 /pmc/articles/PMC9483004/ /pubmed/36160934 http://dx.doi.org/10.5492/wjccm.v11.i5.317 Text en ©The Author(s) 2022. Published by Baishideng Publishing Group Inc. All rights reserved. https://creativecommons.org/licenses/by-nc/4.0/This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. |
spellingShingle | Retrospective Study Caires Silveira, Elena Mattos Pretti, Soraya Santos, Bruna Almeida Santos Corrêa, Caio Fellipe Madureira Silva, Leonardo Freire de Melo, Fabrício Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach |
title | Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach |
title_full | Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach |
title_fullStr | Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach |
title_full_unstemmed | Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach |
title_short | Prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: A machine learning approach |
title_sort | prediction of hospital mortality in intensive care unit patients from clinical and laboratory data: a machine learning approach |
topic | Retrospective Study |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483004/ https://www.ncbi.nlm.nih.gov/pubmed/36160934 http://dx.doi.org/10.5492/wjccm.v11.i5.317 |
work_keys_str_mv | AT cairessilveiraelena predictionofhospitalmortalityinintensivecareunitpatientsfromclinicalandlaboratorydataamachinelearningapproach AT mattosprettisoraya predictionofhospitalmortalityinintensivecareunitpatientsfromclinicalandlaboratorydataamachinelearningapproach AT santosbrunaalmeida predictionofhospitalmortalityinintensivecareunitpatientsfromclinicalandlaboratorydataamachinelearningapproach AT santoscorreacaiofellipe predictionofhospitalmortalityinintensivecareunitpatientsfromclinicalandlaboratorydataamachinelearningapproach AT madureirasilvaleonardo predictionofhospitalmortalityinintensivecareunitpatientsfromclinicalandlaboratorydataamachinelearningapproach AT freiredemelofabricio predictionofhospitalmortalityinintensivecareunitpatientsfromclinicalandlaboratorydataamachinelearningapproach |