Cargando…
Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study
BACKGROUND: Traditional scoring systems for patients’ outcome prediction in intensive care units such as Oxygenation Saturation Index (OSI) and Oxygenation Index (OI) may not reliably predict the clinical prognosis of patients with acute respiratory distress syndrome (ARDS). Thus, none of them have...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246239/ https://www.ncbi.nlm.nih.gov/pubmed/34268407 http://dx.doi.org/10.21037/atm-20-6624 |
_version_ | 1783716273907040256 |
---|---|
author | Huang, Bingsheng Liang, Dong Zou, Rushi Yu, Xiaxia Dan, Guo Huang, Haofan Liu, Heng Liu, Yong |
author_facet | Huang, Bingsheng Liang, Dong Zou, Rushi Yu, Xiaxia Dan, Guo Huang, Haofan Liu, Heng Liu, Yong |
author_sort | Huang, Bingsheng |
collection | PubMed |
description | BACKGROUND: Traditional scoring systems for patients’ outcome prediction in intensive care units such as Oxygenation Saturation Index (OSI) and Oxygenation Index (OI) may not reliably predict the clinical prognosis of patients with acute respiratory distress syndrome (ARDS). Thus, none of them have been widely accepted for mortality prediction in ARDS. This study aimed to develop and validate a mortality prediction method for patients with ARDS based on machine learning using the Medical Information Mart for Intensive Care (MIMIC-III) and Telehealth Intensive Care Unit (eICU) Collaborative Research Database (eICU-CRD) databases. METHODS: Patients with ARDS were selected based on the Berlin definition in MIMIC-III and eICU-CRD databases. The APPS score (using age, PaO(2)/FiO(2), and plateau pressure), Simplified Acute Physiology Score II (SAPS-II), Sepsis-related Organ Failure Assessment (SOFA), OSI, and OI were calculated. With MIMIC-III data, a mortality prediction model was built based on the random forest (RF) algorithm, and the performance was compared to those of existing scoring systems based on logistic regression. The performance of the proposed RF method was also validated with the combined MIMIC-III and eICU-CRD data. The performance of mortality prediction was evaluated by using the area under the receiver operating characteristics curve (AUROC) and performing calibration using the Hosmer-Lemeshow test. RESULTS: With the MIMIC-III dataset (308 patients, for comparisons with the existing scoring systems), the RF model predicted the in-hospital mortality, 30-day mortality, and 1-year mortality with an AUROC of 0.891, 0.883, and 0.892, respectively, which were significantly higher than those of the SAPS-II, APPS, OSI, and OI (all P<0.001). In the multi-source validation (the combined dataset of 2,235 patients in MIMIC-III and 331 patients in eICU-CRD), the RF model achieved an AUROC of 0.905 and 0.736 for predicting in-hospital mortality for the MIMIC-III and eICU-CRD datasets, respectively. The calibration plots suggested good fits for our RF model and these scoring systems for predicting mortality. The platelet count and lactate level were the strongest predictive variables for predicting in-hospital mortality. CONCLUSIONS: Compared to the existing scoring systems, machine learning significantly improved performance for predicting ARDS mortality. Validation with multi-source datasets showed a relatively robust generalisation ability of our prediction model. |
format | Online Article Text |
id | pubmed-8246239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-82462392021-07-14 Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study Huang, Bingsheng Liang, Dong Zou, Rushi Yu, Xiaxia Dan, Guo Huang, Haofan Liu, Heng Liu, Yong Ann Transl Med Original Article BACKGROUND: Traditional scoring systems for patients’ outcome prediction in intensive care units such as Oxygenation Saturation Index (OSI) and Oxygenation Index (OI) may not reliably predict the clinical prognosis of patients with acute respiratory distress syndrome (ARDS). Thus, none of them have been widely accepted for mortality prediction in ARDS. This study aimed to develop and validate a mortality prediction method for patients with ARDS based on machine learning using the Medical Information Mart for Intensive Care (MIMIC-III) and Telehealth Intensive Care Unit (eICU) Collaborative Research Database (eICU-CRD) databases. METHODS: Patients with ARDS were selected based on the Berlin definition in MIMIC-III and eICU-CRD databases. The APPS score (using age, PaO(2)/FiO(2), and plateau pressure), Simplified Acute Physiology Score II (SAPS-II), Sepsis-related Organ Failure Assessment (SOFA), OSI, and OI were calculated. With MIMIC-III data, a mortality prediction model was built based on the random forest (RF) algorithm, and the performance was compared to those of existing scoring systems based on logistic regression. The performance of the proposed RF method was also validated with the combined MIMIC-III and eICU-CRD data. The performance of mortality prediction was evaluated by using the area under the receiver operating characteristics curve (AUROC) and performing calibration using the Hosmer-Lemeshow test. RESULTS: With the MIMIC-III dataset (308 patients, for comparisons with the existing scoring systems), the RF model predicted the in-hospital mortality, 30-day mortality, and 1-year mortality with an AUROC of 0.891, 0.883, and 0.892, respectively, which were significantly higher than those of the SAPS-II, APPS, OSI, and OI (all P<0.001). In the multi-source validation (the combined dataset of 2,235 patients in MIMIC-III and 331 patients in eICU-CRD), the RF model achieved an AUROC of 0.905 and 0.736 for predicting in-hospital mortality for the MIMIC-III and eICU-CRD datasets, respectively. The calibration plots suggested good fits for our RF model and these scoring systems for predicting mortality. The platelet count and lactate level were the strongest predictive variables for predicting in-hospital mortality. CONCLUSIONS: Compared to the existing scoring systems, machine learning significantly improved performance for predicting ARDS mortality. Validation with multi-source datasets showed a relatively robust generalisation ability of our prediction model. AME Publishing Company 2021-05 /pmc/articles/PMC8246239/ /pubmed/34268407 http://dx.doi.org/10.21037/atm-20-6624 Text en 2021 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Huang, Bingsheng Liang, Dong Zou, Rushi Yu, Xiaxia Dan, Guo Huang, Haofan Liu, Heng Liu, Yong Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study |
title | Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study |
title_full | Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study |
title_fullStr | Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study |
title_full_unstemmed | Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study |
title_short | Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study |
title_sort | mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246239/ https://www.ncbi.nlm.nih.gov/pubmed/34268407 http://dx.doi.org/10.21037/atm-20-6624 |
work_keys_str_mv | AT huangbingsheng mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy AT liangdong mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy AT zourushi mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy AT yuxiaxia mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy AT danguo mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy AT huanghaofan mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy AT liuheng mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy AT liuyong mortalitypredictionforpatientswithacuterespiratorydistresssyndromebasedonmachinelearningapopulationbasedstudy |