Cargando…

Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database

Background: Mechanically ventilated patients are susceptible to nosocomial infections such as ventilator-associated pneumonia. To treat ventilated patients with suspected infection, clinicians select appropriate antibiotics. However, decision-making regarding the use of antibiotics for methicillin-r...

Descripción completa

Detalles Bibliográficos
Autores principales: Hirano, Yohei, Shinmoto, Keito, Okada, Yohei, Suga, Kazuhiro, Bombard, Jeffrey, Murahata, Shogo, Shrestha, Manoj, Ocheja, Patrick, Tanaka, Aiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8635043/
https://www.ncbi.nlm.nih.gov/pubmed/34869405
http://dx.doi.org/10.3389/fmed.2021.694520
_version_ 1784608225226653696
author Hirano, Yohei
Shinmoto, Keito
Okada, Yohei
Suga, Kazuhiro
Bombard, Jeffrey
Murahata, Shogo
Shrestha, Manoj
Ocheja, Patrick
Tanaka, Aiko
author_facet Hirano, Yohei
Shinmoto, Keito
Okada, Yohei
Suga, Kazuhiro
Bombard, Jeffrey
Murahata, Shogo
Shrestha, Manoj
Ocheja, Patrick
Tanaka, Aiko
author_sort Hirano, Yohei
collection PubMed
description Background: Mechanically ventilated patients are susceptible to nosocomial infections such as ventilator-associated pneumonia. To treat ventilated patients with suspected infection, clinicians select appropriate antibiotics. However, decision-making regarding the use of antibiotics for methicillin-resistant Staphylococcus aureus (MRSA) is challenging, because of the lack of evidence-supported criteria. This study aims to derive a machine learning model to predict MRSA as a possible pathogen responsible for infection in mechanically ventilated patients. Methods: Data were collected from the Medical Information Mart for Intensive Care (MIMIC)-IV database (an openly available database of patients treated at the Beth Israel Deaconess Medical Center in the period 2008–2019). Of 26,409 mechanically ventilated patients, 809 were screened for MRSA during the mechanical ventilation period and included in the study. The outcome was positivity to MRSA on screening, which was highly imbalanced in the dataset, with 93.9% positive outcomes. Therefore, after dividing the dataset into a training set (n = 566) and a test set (n = 243) for validation by stratified random sampling with a 7:3 allocation ratio, synthetic datasets with 50% positive outcomes were created by synthetic minority over-sampling for both sets individually (synthetic training set: n = 1,064; synthetic test set: n = 456). Using these synthetic datasets, we trained and validated an XGBoost machine learning model using 28 predictor variables for outcome prediction. Model performance was evaluated by area under the receiver operating characteristic (AUROC), sensitivity, specificity, and other statistical measurements. Feature importance was computed by the Gini method. Results: In validation, the XGBoost model demonstrated reliable outcome prediction with an AUROC value of 0.89 [95% confidence interval (CI): 0.83–0.95]. The model showed a high sensitivity of 0.98 [CI: 0.95–0.99], but a low specificity of 0.47 [CI: 0.41–0.54] and a positive predictive value of 0.65 [CI: 0.62–0.68]. Important predictor variables included admission from the emergency department, insertion of arterial lines, prior quinolone use, hemodialysis, and admission to a surgical intensive care unit. Conclusions: We were able to develop an effective machine learning model to predict positive MRSA screening during mechanical ventilation using synthetic datasets, thus encouraging further research to develop a clinically relevant machine learning model for antibiotics stewardship.
format Online
Article
Text
id pubmed-8635043
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86350432021-12-02 Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database Hirano, Yohei Shinmoto, Keito Okada, Yohei Suga, Kazuhiro Bombard, Jeffrey Murahata, Shogo Shrestha, Manoj Ocheja, Patrick Tanaka, Aiko Front Med (Lausanne) Medicine Background: Mechanically ventilated patients are susceptible to nosocomial infections such as ventilator-associated pneumonia. To treat ventilated patients with suspected infection, clinicians select appropriate antibiotics. However, decision-making regarding the use of antibiotics for methicillin-resistant Staphylococcus aureus (MRSA) is challenging, because of the lack of evidence-supported criteria. This study aims to derive a machine learning model to predict MRSA as a possible pathogen responsible for infection in mechanically ventilated patients. Methods: Data were collected from the Medical Information Mart for Intensive Care (MIMIC)-IV database (an openly available database of patients treated at the Beth Israel Deaconess Medical Center in the period 2008–2019). Of 26,409 mechanically ventilated patients, 809 were screened for MRSA during the mechanical ventilation period and included in the study. The outcome was positivity to MRSA on screening, which was highly imbalanced in the dataset, with 93.9% positive outcomes. Therefore, after dividing the dataset into a training set (n = 566) and a test set (n = 243) for validation by stratified random sampling with a 7:3 allocation ratio, synthetic datasets with 50% positive outcomes were created by synthetic minority over-sampling for both sets individually (synthetic training set: n = 1,064; synthetic test set: n = 456). Using these synthetic datasets, we trained and validated an XGBoost machine learning model using 28 predictor variables for outcome prediction. Model performance was evaluated by area under the receiver operating characteristic (AUROC), sensitivity, specificity, and other statistical measurements. Feature importance was computed by the Gini method. Results: In validation, the XGBoost model demonstrated reliable outcome prediction with an AUROC value of 0.89 [95% confidence interval (CI): 0.83–0.95]. The model showed a high sensitivity of 0.98 [CI: 0.95–0.99], but a low specificity of 0.47 [CI: 0.41–0.54] and a positive predictive value of 0.65 [CI: 0.62–0.68]. Important predictor variables included admission from the emergency department, insertion of arterial lines, prior quinolone use, hemodialysis, and admission to a surgical intensive care unit. Conclusions: We were able to develop an effective machine learning model to predict positive MRSA screening during mechanical ventilation using synthetic datasets, thus encouraging further research to develop a clinically relevant machine learning model for antibiotics stewardship. Frontiers Media S.A. 2021-11-16 /pmc/articles/PMC8635043/ /pubmed/34869405 http://dx.doi.org/10.3389/fmed.2021.694520 Text en Copyright © 2021 Hirano, Shinmoto, Okada, Suga, Bombard, Murahata, Shrestha, Ocheja and Tanaka. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Hirano, Yohei
Shinmoto, Keito
Okada, Yohei
Suga, Kazuhiro
Bombard, Jeffrey
Murahata, Shogo
Shrestha, Manoj
Ocheja, Patrick
Tanaka, Aiko
Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_full Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_fullStr Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_full_unstemmed Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_short Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_sort machine learning approach to predict positive screening of methicillin-resistant staphylococcus aureus during mechanical ventilation using synthetic dataset from mimic-iv database
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8635043/
https://www.ncbi.nlm.nih.gov/pubmed/34869405
http://dx.doi.org/10.3389/fmed.2021.694520
work_keys_str_mv AT hiranoyohei machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT shinmotokeito machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT okadayohei machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT sugakazuhiro machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT bombardjeffrey machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT murahatashogo machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT shresthamanoj machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT ochejapatrick machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT tanakaaiko machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase