Cargando…

Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile

BACKGROUND:  Although many risk factors are well known, Clostridium difficile infection (CDI) continues to be a significant problem throughout the world. The purpose of this study was to develop and validate a data-driven, hospital-specific risk stratification procedure for estimating the probabilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Wiens, Jenna, Campbell, Wayne N., Franklin, Ella S., Guttag, John V., Horvitz, Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4281796/
https://www.ncbi.nlm.nih.gov/pubmed/25734117
http://dx.doi.org/10.1093/ofid/ofu045
_version_ 1782351036655075328
author Wiens, Jenna
Campbell, Wayne N.
Franklin, Ella S.
Guttag, John V.
Horvitz, Eric
author_facet Wiens, Jenna
Campbell, Wayne N.
Franklin, Ella S.
Guttag, John V.
Horvitz, Eric
author_sort Wiens, Jenna
collection PubMed
description BACKGROUND:  Although many risk factors are well known, Clostridium difficile infection (CDI) continues to be a significant problem throughout the world. The purpose of this study was to develop and validate a data-driven, hospital-specific risk stratification procedure for estimating the probability that an inpatient will test positive for C difficile. METHODS:  We consider electronic medical record (EMR) data from patients admitted for ≥24 hours to a large urban hospital in the U.S. between April 2011 and April 2013. Predictive models were constructed using L2-regularized logistic regression and data from the first year. The number of observational variables considered varied from a small set of well known risk factors readily available to a physician to over 10 000 variables automatically extracted from the EMR. Each model was evaluated on holdout admission data from the following year. A total of 34 846 admissions with 372 cases of CDI was used to train the model. RESULTS:  Applied to the separate validation set of 34 722 admissions with 355 cases of CDI, the model that made use of the additional EMR data yielded an area under the receiver operating characteristic curve (AUROC) of 0.81 (95% confidence interval [CI], .79–.83), and it significantly outperformed the model that considered only the small set of known clinical risk factors, AUROC of 0.71 (95% CI, .69–.75). CONCLUSIONS:  Automated risk stratification of patients based on the contents of their EMRs can be used to accurately identify a high-risk population of patients. The proposed method holds promise for enabling the selective allocation of interventions aimed at reducing the rate of CDI.
format Online
Article
Text
id pubmed-4281796
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42817962015-03-02 Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile Wiens, Jenna Campbell, Wayne N. Franklin, Ella S. Guttag, John V. Horvitz, Eric Open Forum Infect Dis Major Articles BACKGROUND:  Although many risk factors are well known, Clostridium difficile infection (CDI) continues to be a significant problem throughout the world. The purpose of this study was to develop and validate a data-driven, hospital-specific risk stratification procedure for estimating the probability that an inpatient will test positive for C difficile. METHODS:  We consider electronic medical record (EMR) data from patients admitted for ≥24 hours to a large urban hospital in the U.S. between April 2011 and April 2013. Predictive models were constructed using L2-regularized logistic regression and data from the first year. The number of observational variables considered varied from a small set of well known risk factors readily available to a physician to over 10 000 variables automatically extracted from the EMR. Each model was evaluated on holdout admission data from the following year. A total of 34 846 admissions with 372 cases of CDI was used to train the model. RESULTS:  Applied to the separate validation set of 34 722 admissions with 355 cases of CDI, the model that made use of the additional EMR data yielded an area under the receiver operating characteristic curve (AUROC) of 0.81 (95% confidence interval [CI], .79–.83), and it significantly outperformed the model that considered only the small set of known clinical risk factors, AUROC of 0.71 (95% CI, .69–.75). CONCLUSIONS:  Automated risk stratification of patients based on the contents of their EMRs can be used to accurately identify a high-risk population of patients. The proposed method holds promise for enabling the selective allocation of interventions aimed at reducing the rate of CDI. Oxford University Press 2014-07-15 /pmc/articles/PMC4281796/ /pubmed/25734117 http://dx.doi.org/10.1093/ofid/ofu045 Text en © The Author 2014. Published by Oxford University Press on behalf of the Infectious Diseases Society of America. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Major Articles
Wiens, Jenna
Campbell, Wayne N.
Franklin, Ella S.
Guttag, John V.
Horvitz, Eric
Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile
title Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile
title_full Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile
title_fullStr Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile
title_full_unstemmed Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile
title_short Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile
title_sort learning data-driven patient risk stratification models for clostridium difficile
topic Major Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4281796/
https://www.ncbi.nlm.nih.gov/pubmed/25734117
http://dx.doi.org/10.1093/ofid/ofu045
work_keys_str_mv AT wiensjenna learningdatadrivenpatientriskstratificationmodelsforclostridiumdifficile
AT campbellwaynen learningdatadrivenpatientriskstratificationmodelsforclostridiumdifficile
AT franklinellas learningdatadrivenpatientriskstratificationmodelsforclostridiumdifficile
AT guttagjohnv learningdatadrivenpatientriskstratificationmodelsforclostridiumdifficile
AT horvitzeric learningdatadrivenpatientriskstratificationmodelsforclostridiumdifficile