Cargando…

Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database

Background: Mechanically ventilated patients in the intensive care unit (ICU) have high mortality rates. There are multiple prediction scores, such as the Simplified Acute Physiology Score II (SAPS II), Oxford Acute Severity of Illness Score (OASIS), and Sequential Organ Failure Assessment (SOFA), w...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yibing, Zhang, Jin, Wang, Guowei, Yao, Renqi, Ren, Chao, Chen, Ge, Jin, Xin, Guo, Junyang, Liu, Shi, Zheng, Hua, Chen, Yan, Guo, Qianqian, Li, Lin, Du, Bin, Xi, Xiuming, Li, Wei, Huang, Huibin, Li, Yang, Yu, Qian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8280779/
https://www.ncbi.nlm.nih.gov/pubmed/34277655
http://dx.doi.org/10.3389/fmed.2021.662340
_version_ 1783722711934042112
author Zhu, Yibing
Zhang, Jin
Wang, Guowei
Yao, Renqi
Ren, Chao
Chen, Ge
Jin, Xin
Guo, Junyang
Liu, Shi
Zheng, Hua
Chen, Yan
Guo, Qianqian
Li, Lin
Du, Bin
Xi, Xiuming
Li, Wei
Huang, Huibin
Li, Yang
Yu, Qian
author_facet Zhu, Yibing
Zhang, Jin
Wang, Guowei
Yao, Renqi
Ren, Chao
Chen, Ge
Jin, Xin
Guo, Junyang
Liu, Shi
Zheng, Hua
Chen, Yan
Guo, Qianqian
Li, Lin
Du, Bin
Xi, Xiuming
Li, Wei
Huang, Huibin
Li, Yang
Yu, Qian
author_sort Zhu, Yibing
collection PubMed
description Background: Mechanically ventilated patients in the intensive care unit (ICU) have high mortality rates. There are multiple prediction scores, such as the Simplified Acute Physiology Score II (SAPS II), Oxford Acute Severity of Illness Score (OASIS), and Sequential Organ Failure Assessment (SOFA), widely used in the general ICU population. We aimed to establish prediction scores on mechanically ventilated patients with the combination of these disease severity scores and other features available on the first day of admission. Methods: A retrospective administrative database study from the Medical Information Mart for Intensive Care (MIMIC-III) database was conducted. The exposures of interest consisted of the demographics, pre-ICU comorbidity, ICU diagnosis, disease severity scores, vital signs, and laboratory test results on the first day of ICU admission. Hospital mortality was used as the outcome. We used the machine learning methods of k-nearest neighbors (KNN), logistic regression, bagging, decision tree, random forest, Extreme Gradient Boosting (XGBoost), and neural network for model establishment. A sample of 70% of the cohort was used for the training set; the remaining 30% was applied for testing. Areas under the receiver operating characteristic curves (AUCs) and calibration plots would be constructed for the evaluation and comparison of the models' performance. The significance of the risk factors was identified through models and the top factors were reported. Results: A total of 28,530 subjects were enrolled through the screening of the MIMIC-III database. After data preprocessing, 25,659 adult patients with 66 predictors were included in the model analyses. With the training set, the models of KNN, logistic regression, decision tree, random forest, neural network, bagging, and XGBoost were established and the testing set obtained AUCs of 0.806, 0.818, 0.743, 0.819, 0.780, 0.803, and 0.821, respectively. The calibration curves of all the models, except for the neural network, performed well. The XGBoost model performed best among the seven models. The top five predictors were age, respiratory dysfunction, SAPS II score, maximum hemoglobin, and minimum lactate. Conclusion: The current study indicates that models with the risk of factors on the first day could be successfully established for predicting mortality in ventilated patients. The XGBoost model performs best among the seven machine learning models.
format Online
Article
Text
id pubmed-8280779
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82807792021-07-16 Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database Zhu, Yibing Zhang, Jin Wang, Guowei Yao, Renqi Ren, Chao Chen, Ge Jin, Xin Guo, Junyang Liu, Shi Zheng, Hua Chen, Yan Guo, Qianqian Li, Lin Du, Bin Xi, Xiuming Li, Wei Huang, Huibin Li, Yang Yu, Qian Front Med (Lausanne) Medicine Background: Mechanically ventilated patients in the intensive care unit (ICU) have high mortality rates. There are multiple prediction scores, such as the Simplified Acute Physiology Score II (SAPS II), Oxford Acute Severity of Illness Score (OASIS), and Sequential Organ Failure Assessment (SOFA), widely used in the general ICU population. We aimed to establish prediction scores on mechanically ventilated patients with the combination of these disease severity scores and other features available on the first day of admission. Methods: A retrospective administrative database study from the Medical Information Mart for Intensive Care (MIMIC-III) database was conducted. The exposures of interest consisted of the demographics, pre-ICU comorbidity, ICU diagnosis, disease severity scores, vital signs, and laboratory test results on the first day of ICU admission. Hospital mortality was used as the outcome. We used the machine learning methods of k-nearest neighbors (KNN), logistic regression, bagging, decision tree, random forest, Extreme Gradient Boosting (XGBoost), and neural network for model establishment. A sample of 70% of the cohort was used for the training set; the remaining 30% was applied for testing. Areas under the receiver operating characteristic curves (AUCs) and calibration plots would be constructed for the evaluation and comparison of the models' performance. The significance of the risk factors was identified through models and the top factors were reported. Results: A total of 28,530 subjects were enrolled through the screening of the MIMIC-III database. After data preprocessing, 25,659 adult patients with 66 predictors were included in the model analyses. With the training set, the models of KNN, logistic regression, decision tree, random forest, neural network, bagging, and XGBoost were established and the testing set obtained AUCs of 0.806, 0.818, 0.743, 0.819, 0.780, 0.803, and 0.821, respectively. The calibration curves of all the models, except for the neural network, performed well. The XGBoost model performed best among the seven models. The top five predictors were age, respiratory dysfunction, SAPS II score, maximum hemoglobin, and minimum lactate. Conclusion: The current study indicates that models with the risk of factors on the first day could be successfully established for predicting mortality in ventilated patients. The XGBoost model performs best among the seven machine learning models. Frontiers Media S.A. 2021-07-01 /pmc/articles/PMC8280779/ /pubmed/34277655 http://dx.doi.org/10.3389/fmed.2021.662340 Text en Copyright © 2021 Zhu, Zhang, Wang, Yao, Ren, Chen, Jin, Guo, Liu, Zheng, Chen, Guo, Li, Du, Xi, Li, Huang, Li and Yu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Zhu, Yibing
Zhang, Jin
Wang, Guowei
Yao, Renqi
Ren, Chao
Chen, Ge
Jin, Xin
Guo, Junyang
Liu, Shi
Zheng, Hua
Chen, Yan
Guo, Qianqian
Li, Lin
Du, Bin
Xi, Xiuming
Li, Wei
Huang, Huibin
Li, Yang
Yu, Qian
Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database
title Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database
title_full Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database
title_fullStr Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database
title_full_unstemmed Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database
title_short Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database
title_sort machine learning prediction models for mechanically ventilated patients: analyses of the mimic-iii database
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8280779/
https://www.ncbi.nlm.nih.gov/pubmed/34277655
http://dx.doi.org/10.3389/fmed.2021.662340
work_keys_str_mv AT zhuyibing machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT zhangjin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT wangguowei machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT yaorenqi machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT renchao machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT chenge machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT jinxin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT guojunyang machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT liushi machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT zhenghua machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT chenyan machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT guoqianqian machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT lilin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT dubin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT xixiuming machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT liwei machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT huanghuibin machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT liyang machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase
AT yuqian machinelearningpredictionmodelsformechanicallyventilatedpatientsanalysesofthemimiciiidatabase