Cargando…

Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study

BACKGROUND: Traditional scoring systems for patients’ outcome prediction in intensive care units such as Oxygenation Saturation Index (OSI) and Oxygenation Index (OI) may not reliably predict the clinical prognosis of patients with acute respiratory distress syndrome (ARDS). Thus, none of them have...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Bingsheng, Liang, Dong, Zou, Rushi, Yu, Xiaxia, Dan, Guo, Huang, Haofan, Liu, Heng, Liu, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246239/
https://www.ncbi.nlm.nih.gov/pubmed/34268407
http://dx.doi.org/10.21037/atm-20-6624
_version_ 1783716273907040256
author Huang, Bingsheng
Liang, Dong
Zou, Rushi
Yu, Xiaxia
Dan, Guo
Huang, Haofan
Liu, Heng
Liu, Yong
author_facet Huang, Bingsheng
Liang, Dong
Zou, Rushi
Yu, Xiaxia
Dan, Guo
Huang, Haofan
Liu, Heng
Liu, Yong
author_sort Huang, Bingsheng
collection PubMed
description BACKGROUND: Traditional scoring systems for patients’ outcome prediction in intensive care units such as Oxygenation Saturation Index (OSI) and Oxygenation Index (OI) may not reliably predict the clinical prognosis of patients with acute respiratory distress syndrome (ARDS). Thus, none of them have been widely accepted for mortality prediction in ARDS. This study aimed to develop and validate a mortality prediction method for patients with ARDS based on machine learning using the Medical Information Mart for Intensive Care (MIMIC-III) and Telehealth Intensive Care Unit (eICU) Collaborative Research Database (eICU-CRD) databases. METHODS: Patients with ARDS were selected based on the Berlin definition in MIMIC-III and eICU-CRD databases. The APPS score (using age, PaO(2)/FiO(2), and plateau pressure), Simplified Acute Physiology Score II (SAPS-II), Sepsis-related Organ Failure Assessment (SOFA), OSI, and OI were calculated. With MIMIC-III data, a mortality prediction model was built based on the random forest (RF) algorithm, and the performance was compared to those of existing scoring systems based on logistic regression. The performance of the proposed RF method was also validated with the combined MIMIC-III and eICU-CRD data. The performance of mortality prediction was evaluated by using the area under the receiver operating characteristics curve (AUROC) and performing calibration using the Hosmer-Lemeshow test. RESULTS: With the MIMIC-III dataset (308 patients, for comparisons with the existing scoring systems), the RF model predicted the in-hospital mortality, 30-day mortality, and 1-year mortality with an AUROC of 0.891, 0.883, and 0.892, respectively, which were significantly higher than those of the SAPS-II, APPS, OSI, and OI (all P<0.001). In the multi-source validation (the combined dataset of 2,235 patients in MIMIC-III and 331 patients in eICU-CRD), the RF model achieved an AUROC of 0.905 and 0.736 for predicting in-hospital mortality for the MIMIC-III and eICU-CRD datasets, respectively. The calibration plots suggested good fits for our RF model and these scoring systems for predicting mortality. The platelet count and lactate level were the strongest predictive variables for predicting in-hospital mortality. CONCLUSIONS: Compared to the existing scoring systems, machine learning significantly improved performance for predicting ARDS mortality. Validation with multi-source datasets showed a relatively robust generalisation ability of our prediction model.
format Online
Article
Text
id pubmed-8246239
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-82462392021-07-14 Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study Huang, Bingsheng Liang, Dong Zou, Rushi Yu, Xiaxia Dan, Guo Huang, Haofan Liu, Heng Liu, Yong Ann Transl Med Original Article BACKGROUND: Traditional scoring systems for patients’ outcome prediction in intensive care units such as Oxygenation Saturation Index (OSI) and Oxygenation Index (OI) may not reliably predict the clinical prognosis of patients with acute respiratory distress syndrome (ARDS). Thus, none of them have been widely accepted for mortality prediction in ARDS. This study aimed to develop and validate a mortality prediction method for patients with ARDS based on machine learning using the Medical Information Mart for Intensive Care (MIMIC-III) and Telehealth Intensive Care Unit (eICU) Collaborative Research Database (eICU-CRD) databases. METHODS: Patients with ARDS were selected based on the Berlin definition in MIMIC-III and eICU-CRD databases. The APPS score (using age, PaO(2)/FiO(2), and plateau pressure), Simplified Acute Physiology Score II (SAPS-II), Sepsis-related Organ Failure Assessment (SOFA), OSI, and OI were calculated. With MIMIC-III data, a mortality prediction model was built based on the random forest (RF) algorithm, and the performance was compared to those of existing scoring systems based on logistic regression. The performance of the proposed RF method was also validated with the combined MIMIC-III and eICU-CRD data. The performance of mortality prediction was evaluated by using the area under the receiver operating characteristics curve (AUROC) and performing calibration using the Hosmer-Lemeshow test. RESULTS: With the MIMIC-III dataset (308 patients, for comparisons with the existing scoring systems), the RF model predicted the in-hospital mortality, 30-day mortality, and 1-year mortality with an AUROC of 0.891, 0.883, and 0.892, respectively, which were significantly higher than those of the SAPS-II, APPS, OSI, and OI (all P<0.001). In the multi-source validation (the combined dataset of 2,235 patients in MIMIC-III and 331 patients in eICU-CRD), the RF model achieved an AUROC of 0.905 and 0.736 for predicting in-hospital mortality for the MIMIC-III and eICU-CRD datasets, respectively. The calibration plots suggested good fits for our RF model and these scoring systems for predicting mortality. The platelet count and lactate level were the strongest predictive variables for predicting in-hospital mortality. CONCLUSIONS: Compared to the existing scoring systems, machine learning significantly improved performance for predicting ARDS mortality. Validation with multi-source datasets showed a relatively robust generalisation ability of our prediction model. AME Publishing Company 2021-05 /pmc/articles/PMC8246239/ /pubmed/34268407 http://dx.doi.org/10.21037/atm-20-6624 Text en 2021 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Huang, Bingsheng
Liang, Dong
Zou, Rushi
Yu, Xiaxia
Dan, Guo
Huang, Haofan
Liu, Heng
Liu, Yong
Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
title Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
title_full Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
title_fullStr Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
title_full_unstemmed Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
title_short Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
title_sort mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246239/
https://www.ncbi.nlm.nih.gov/pubmed/34268407
http://dx.doi.org/10.21037/atm-20-6624
work_keys_str_mv AT huangbingsheng mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy
AT liangdong mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy
AT zourushi mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy
AT yuxiaxia mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy
AT danguo mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy
AT huanghaofan mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy
AT liuheng mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy
AT liuyong mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy