Cargando…

Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit

BACKGROUND: With the recent change in the definition (Sepsis-3 Definition) of sepsis and septic shock, an electronic search algorithm was required to identify the cases for data automation. This supervised machine learning method would help screen a large amount of electronic medical records (EMR) f...

Descripción completa

Detalles Bibliográficos
Autores principales: Dhungana, Prabij, Serafim, Laura Piccolo, Ruiz, Arnaldo Lopez, Bruns, Danette, Weister, Timothy J, Smischney, Nathan Jerome, Kashyap, Rahul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Baishideng Publishing Group Inc 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6918045/
https://www.ncbi.nlm.nih.gov/pubmed/31853447
http://dx.doi.org/10.5492/wjccm.v8.i7.120
_version_ 1783480501982461952
author Dhungana, Prabij
Serafim, Laura Piccolo
Ruiz, Arnaldo Lopez
Bruns, Danette
Weister, Timothy J
Smischney, Nathan Jerome
Kashyap, Rahul
author_facet Dhungana, Prabij
Serafim, Laura Piccolo
Ruiz, Arnaldo Lopez
Bruns, Danette
Weister, Timothy J
Smischney, Nathan Jerome
Kashyap, Rahul
author_sort Dhungana, Prabij
collection PubMed
description BACKGROUND: With the recent change in the definition (Sepsis-3 Definition) of sepsis and septic shock, an electronic search algorithm was required to identify the cases for data automation. This supervised machine learning method would help screen a large amount of electronic medical records (EMR) for efficient research purposes. AIM: To develop and validate a computable phenotype via supervised machine learning method for retrospectively identifying sepsis and septic shock in critical care patients. METHODS: A supervised machine learning method was developed based on culture orders, Sequential Organ Failure Assessment (SOFA) scores, serum lactate levels and vasopressor use in the intensive care units (ICUs). The computable phenotype was derived from a retrospective analysis of a random cohort of 100 patients admitted to the medical ICU. This was then validated in an independent cohort of 100 patients. We compared the results from computable phenotype to a gold standard by manual review of EMR by 2 blinded reviewers. Disagreement was resolved by a critical care clinician. A SOFA score ≥ 2 during the ICU stay with a culture 72 h before or after the time of admission was identified. Sepsis versions as V1 was defined as blood cultures with SOFA ≥ 2 and Sepsis V2 was defined as any culture with SOFA score ≥ 2. A serum lactate level ≥ 2 mmol/L from 24 h before admission till their stay in the ICU and vasopressor use with Sepsis-1 and-2 were identified as Septic Shock-V1 and-V2 respectively. RESULTS: In the derivation subset of 100 random patients, the final machine learning strategy achieved a sensitivity-specificity of 100% and 84% for Sepsis-1, 100% and 95% for Sepsis-2, 78% and 80% for Septic Shock-1, and 80% and 90% for Septic Shock-2. An overall percent of agreement between two blinded reviewers had a k = 0.86 and 0.90 for Sepsis 2 and Septic shock 2 respectively. In validation of the algorithm through a separate 100 random patient subset, the reported sensitivity and specificity for all 4 diagnoses were 100%-100% each. CONCLUSION: Supervised machine learning for identification of sepsis and septic shock is reliable and an efficient alternative to manual chart review.
format Online
Article
Text
id pubmed-6918045
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Baishideng Publishing Group Inc
record_format MEDLINE/PubMed
spelling pubmed-69180452019-12-18 Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit Dhungana, Prabij Serafim, Laura Piccolo Ruiz, Arnaldo Lopez Bruns, Danette Weister, Timothy J Smischney, Nathan Jerome Kashyap, Rahul World J Crit Care Med Retrospective Cohort Study BACKGROUND: With the recent change in the definition (Sepsis-3 Definition) of sepsis and septic shock, an electronic search algorithm was required to identify the cases for data automation. This supervised machine learning method would help screen a large amount of electronic medical records (EMR) for efficient research purposes. AIM: To develop and validate a computable phenotype via supervised machine learning method for retrospectively identifying sepsis and septic shock in critical care patients. METHODS: A supervised machine learning method was developed based on culture orders, Sequential Organ Failure Assessment (SOFA) scores, serum lactate levels and vasopressor use in the intensive care units (ICUs). The computable phenotype was derived from a retrospective analysis of a random cohort of 100 patients admitted to the medical ICU. This was then validated in an independent cohort of 100 patients. We compared the results from computable phenotype to a gold standard by manual review of EMR by 2 blinded reviewers. Disagreement was resolved by a critical care clinician. A SOFA score ≥ 2 during the ICU stay with a culture 72 h before or after the time of admission was identified. Sepsis versions as V1 was defined as blood cultures with SOFA ≥ 2 and Sepsis V2 was defined as any culture with SOFA score ≥ 2. A serum lactate level ≥ 2 mmol/L from 24 h before admission till their stay in the ICU and vasopressor use with Sepsis-1 and-2 were identified as Septic Shock-V1 and-V2 respectively. RESULTS: In the derivation subset of 100 random patients, the final machine learning strategy achieved a sensitivity-specificity of 100% and 84% for Sepsis-1, 100% and 95% for Sepsis-2, 78% and 80% for Septic Shock-1, and 80% and 90% for Septic Shock-2. An overall percent of agreement between two blinded reviewers had a k = 0.86 and 0.90 for Sepsis 2 and Septic shock 2 respectively. In validation of the algorithm through a separate 100 random patient subset, the reported sensitivity and specificity for all 4 diagnoses were 100%-100% each. CONCLUSION: Supervised machine learning for identification of sepsis and septic shock is reliable and an efficient alternative to manual chart review. Baishideng Publishing Group Inc 2019-11-19 /pmc/articles/PMC6918045/ /pubmed/31853447 http://dx.doi.org/10.5492/wjccm.v8.i7.120 Text en ©The Author(s) 2019. Published by Baishideng Publishing Group Inc. All rights reserved. http://creativecommons.org/licenses/by-nc/4.0/ This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial.
spellingShingle Retrospective Cohort Study
Dhungana, Prabij
Serafim, Laura Piccolo
Ruiz, Arnaldo Lopez
Bruns, Danette
Weister, Timothy J
Smischney, Nathan Jerome
Kashyap, Rahul
Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit
title Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit
title_full Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit
title_fullStr Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit
title_full_unstemmed Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit
title_short Machine learning in data abstraction: A computable phenotype for sepsis and septic shock diagnosis in the intensive care unit
title_sort machine learning in data abstraction: a computable phenotype for sepsis and septic shock diagnosis in the intensive care unit
topic Retrospective Cohort Study
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6918045/
https://www.ncbi.nlm.nih.gov/pubmed/31853447
http://dx.doi.org/10.5492/wjccm.v8.i7.120
work_keys_str_mv AT dhunganaprabij machinelearningindataabstractionacomputablephenotypeforsepsisandsepticshockdiagnosisintheintensivecareunit
AT serafimlaurapiccolo machinelearningindataabstractionacomputablephenotypeforsepsisandsepticshockdiagnosisintheintensivecareunit
AT ruizarnaldolopez machinelearningindataabstractionacomputablephenotypeforsepsisandsepticshockdiagnosisintheintensivecareunit
AT brunsdanette machinelearningindataabstractionacomputablephenotypeforsepsisandsepticshockdiagnosisintheintensivecareunit
AT weistertimothyj machinelearningindataabstractionacomputablephenotypeforsepsisandsepticshockdiagnosisintheintensivecareunit
AT smischneynathanjerome machinelearningindataabstractionacomputablephenotypeforsepsisandsepticshockdiagnosisintheintensivecareunit
AT kashyaprahul machinelearningindataabstractionacomputablephenotypeforsepsisandsepticshockdiagnosisintheintensivecareunit