Cargando…
Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database
Background: Mechanically ventilated patients in the intensive care unit (ICU) have high mortality rates. There are multiple prediction scores, such as the Simplified Acute Physiology Score II (SAPS II), Oxford Acute Severity of Illness Score (OASIS), and Sequential Organ Failure Assessment (SOFA), w...
Autores principales: | , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8280779/ https://www.ncbi.nlm.nih.gov/pubmed/34277655 http://dx.doi.org/10.3389/fmed.2021.662340 |
_version_ | 1783722711934042112 |
---|---|
author | Zhu, Yibing Zhang, Jin Wang, Guowei Yao, Renqi Ren, Chao Chen, Ge Jin, Xin Guo, Junyang Liu, Shi Zheng, Hua Chen, Yan Guo, Qianqian Li, Lin Du, Bin Xi, Xiuming Li, Wei Huang, Huibin Li, Yang Yu, Qian |
author_facet | Zhu, Yibing Zhang, Jin Wang, Guowei Yao, Renqi Ren, Chao Chen, Ge Jin, Xin Guo, Junyang Liu, Shi Zheng, Hua Chen, Yan Guo, Qianqian Li, Lin Du, Bin Xi, Xiuming Li, Wei Huang, Huibin Li, Yang Yu, Qian |
author_sort | Zhu, Yibing |
collection | PubMed |
description | Background: Mechanically ventilated patients in the intensive care unit (ICU) have high mortality rates. There are multiple prediction scores, such as the Simplified Acute Physiology Score II (SAPS II), Oxford Acute Severity of Illness Score (OASIS), and Sequential Organ Failure Assessment (SOFA), widely used in the general ICU population. We aimed to establish prediction scores on mechanically ventilated patients with the combination of these disease severity scores and other features available on the first day of admission. Methods: A retrospective administrative database study from the Medical Information Mart for Intensive Care (MIMIC-III) database was conducted. The exposures of interest consisted of the demographics, pre-ICU comorbidity, ICU diagnosis, disease severity scores, vital signs, and laboratory test results on the first day of ICU admission. Hospital mortality was used as the outcome. We used the machine learning methods of k-nearest neighbors (KNN), logistic regression, bagging, decision tree, random forest, Extreme Gradient Boosting (XGBoost), and neural network for model establishment. A sample of 70% of the cohort was used for the training set; the remaining 30% was applied for testing. Areas under the receiver operating characteristic curves (AUCs) and calibration plots would be constructed for the evaluation and comparison of the models' performance. The significance of the risk factors was identified through models and the top factors were reported. Results: A total of 28,530 subjects were enrolled through the screening of the MIMIC-III database. After data preprocessing, 25,659 adult patients with 66 predictors were included in the model analyses. With the training set, the models of KNN, logistic regression, decision tree, random forest, neural network, bagging, and XGBoost were established and the testing set obtained AUCs of 0.806, 0.818, 0.743, 0.819, 0.780, 0.803, and 0.821, respectively. The calibration curves of all the models, except for the neural network, performed well. The XGBoost model performed best among the seven models. The top five predictors were age, respiratory dysfunction, SAPS II score, maximum hemoglobin, and minimum lactate. Conclusion: The current study indicates that models with the risk of factors on the first day could be successfully established for predicting mortality in ventilated patients. The XGBoost model performs best among the seven machine learning models. |
format | Online Article Text |
id | pubmed-8280779 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82807792021-07-16 Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database Zhu, Yibing Zhang, Jin Wang, Guowei Yao, Renqi Ren, Chao Chen, Ge Jin, Xin Guo, Junyang Liu, Shi Zheng, Hua Chen, Yan Guo, Qianqian Li, Lin Du, Bin Xi, Xiuming Li, Wei Huang, Huibin Li, Yang Yu, Qian Front Med (Lausanne) Medicine Background: Mechanically ventilated patients in the intensive care unit (ICU) have high mortality rates. There are multiple prediction scores, such as the Simplified Acute Physiology Score II (SAPS II), Oxford Acute Severity of Illness Score (OASIS), and Sequential Organ Failure Assessment (SOFA), widely used in the general ICU population. We aimed to establish prediction scores on mechanically ventilated patients with the combination of these disease severity scores and other features available on the first day of admission. Methods: A retrospective administrative database study from the Medical Information Mart for Intensive Care (MIMIC-III) database was conducted. The exposures of interest consisted of the demographics, pre-ICU comorbidity, ICU diagnosis, disease severity scores, vital signs, and laboratory test results on the first day of ICU admission. Hospital mortality was used as the outcome. We used the machine learning methods of k-nearest neighbors (KNN), logistic regression, bagging, decision tree, random forest, Extreme Gradient Boosting (XGBoost), and neural network for model establishment. A sample of 70% of the cohort was used for the training set; the remaining 30% was applied for testing. Areas under the receiver operating characteristic curves (AUCs) and calibration plots would be constructed for the evaluation and comparison of the models' performance. The significance of the risk factors was identified through models and the top factors were reported. Results: A total of 28,530 subjects were enrolled through the screening of the MIMIC-III database. After data preprocessing, 25,659 adult patients with 66 predictors were included in the model analyses. With the training set, the models of KNN, logistic regression, decision tree, random forest, neural network, bagging, and XGBoost were established and the testing set obtained AUCs of 0.806, 0.818, 0.743, 0.819, 0.780, 0.803, and 0.821, respectively. The calibration curves of all the models, except for the neural network, performed well. The XGBoost model performed best among the seven models. The top five predictors were age, respiratory dysfunction, SAPS II score, maximum hemoglobin, and minimum lactate. Conclusion: The current study indicates that models with the risk of factors on the first day could be successfully established for predicting mortality in ventilated patients. The XGBoost model performs best among the seven machine learning models. Frontiers Media S.A. 2021-07-01 /pmc/articles/PMC8280779/ /pubmed/34277655 http://dx.doi.org/10.3389/fmed.2021.662340 Text en Copyright © 2021 Zhu, Zhang, Wang, Yao, Ren, Chen, Jin, Guo, Liu, Zheng, Chen, Guo, Li, Du, Xi, Li, Huang, Li and Yu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Medicine Zhu, Yibing Zhang, Jin Wang, Guowei Yao, Renqi Ren, Chao Chen, Ge Jin, Xin Guo, Junyang Liu, Shi Zheng, Hua Chen, Yan Guo, Qianqian Li, Lin Du, Bin Xi, Xiuming Li, Wei Huang, Huibin Li, Yang Yu, Qian Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database |
title | Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database |
title_full | Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database |
title_fullStr | Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database |
title_full_unstemmed | Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database |
title_short | Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database |
title_sort | machine learning prediction models for mechanically ventilated patients: analyses of the mimic-iii database |
topic | Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8280779/ https://www.ncbi.nlm.nih.gov/pubmed/34277655 http://dx.doi.org/10.3389/fmed.2021.662340 |
work_keys_str_mv | AT zhuyibing machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT zhangjin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT wangguowei machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT yaorenqi machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT renchao machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT chenge machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT jinxin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT guojunyang machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT liushi machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT zhenghua machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT chenyan machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT guoqianqian machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT lilin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT dubin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT xixiuming machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT liwei machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT huanghuibin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT liyang machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase AT yuqian machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase |