Cargando…

Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease

INTRODUCTION: Previous work had shown that machine learning models can predict inflammatory bowel disease (IBD)-related hospitalizations and outpatient corticosteroid use based on patient demographic and laboratory data in a cohort of United States Veterans. This study aimed to replicate this modeli...

Descripción completa

Detalles Bibliográficos
Autores principales: Gan, Ryan W., Sun, Diana, Tatro, Amanda R., Cohen-Mekelburg, Shirley, Wiitala, Wyndy L., Zhu, Ji, Waljee, Akbar K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8452029/
https://www.ncbi.nlm.nih.gov/pubmed/34543353
http://dx.doi.org/10.1371/journal.pone.0257520
_version_ 1784569980662054912
author Gan, Ryan W.
Sun, Diana
Tatro, Amanda R.
Cohen-Mekelburg, Shirley
Wiitala, Wyndy L.
Zhu, Ji
Waljee, Akbar K.
author_facet Gan, Ryan W.
Sun, Diana
Tatro, Amanda R.
Cohen-Mekelburg, Shirley
Wiitala, Wyndy L.
Zhu, Ji
Waljee, Akbar K.
author_sort Gan, Ryan W.
collection PubMed
description INTRODUCTION: Previous work had shown that machine learning models can predict inflammatory bowel disease (IBD)-related hospitalizations and outpatient corticosteroid use based on patient demographic and laboratory data in a cohort of United States Veterans. This study aimed to replicate this modeling framework in a nationally representative cohort. METHODS: A retrospective cohort design using Optum Electronic Health Records (EHR) were used to identify IBD patients, with at least 12 months of follow-up between 2007 and 2018. IBD flare was defined as an inpatient/emergency visit with a diagnosis of IBD or an outpatient corticosteroid prescription for IBD. Predictors included demographic and laboratory data. Logistic regression and random forest (RF) models were used to predict IBD flare within 6 months of each visit. A 70% training and 30% validation approach was used. RESULTS: A total of 95,878 patients across 780,559 visits were identified. Of these, 22,245 (23.2%) patients had at least one IBD flare. Patients were predominantly White (87.7%) and female (57.1%), with a mean age of 48.0 years. The logistic regression model had an area under the receiver operating curve (AuROC) of 0.66 (95% CI: 0.65−0.66), sensitivity of 0.69 (95% CI: 0.68−0.70), and specificity of 0.74 (95% CI: 0.73−0.74) in the validation cohort. The RF model had an AuROC of 0.80 (95% CI: 0.80−0.81), sensitivity of 0.74 (95% CI: 0.73−0.74), and specificity of 0.72 (95% CI: 0.72−0.72) in the validation cohort. Important predictors of IBD flare in the RF model were the number of previous flares, age, potassium, and white blood cell count. CONCLUSION: The machine learning modeling framework was replicated and results showed a similar predictive accuracy in a nationally representative cohort of IBD patients. This modeling framework could be embedded in routine practice as a tool to distinguish high-risk patients for disease activity.
format Online
Article
Text
id pubmed-8452029
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-84520292021-09-21 Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease Gan, Ryan W. Sun, Diana Tatro, Amanda R. Cohen-Mekelburg, Shirley Wiitala, Wyndy L. Zhu, Ji Waljee, Akbar K. PLoS One Research Article INTRODUCTION: Previous work had shown that machine learning models can predict inflammatory bowel disease (IBD)-related hospitalizations and outpatient corticosteroid use based on patient demographic and laboratory data in a cohort of United States Veterans. This study aimed to replicate this modeling framework in a nationally representative cohort. METHODS: A retrospective cohort design using Optum Electronic Health Records (EHR) were used to identify IBD patients, with at least 12 months of follow-up between 2007 and 2018. IBD flare was defined as an inpatient/emergency visit with a diagnosis of IBD or an outpatient corticosteroid prescription for IBD. Predictors included demographic and laboratory data. Logistic regression and random forest (RF) models were used to predict IBD flare within 6 months of each visit. A 70% training and 30% validation approach was used. RESULTS: A total of 95,878 patients across 780,559 visits were identified. Of these, 22,245 (23.2%) patients had at least one IBD flare. Patients were predominantly White (87.7%) and female (57.1%), with a mean age of 48.0 years. The logistic regression model had an area under the receiver operating curve (AuROC) of 0.66 (95% CI: 0.65−0.66), sensitivity of 0.69 (95% CI: 0.68−0.70), and specificity of 0.74 (95% CI: 0.73−0.74) in the validation cohort. The RF model had an AuROC of 0.80 (95% CI: 0.80−0.81), sensitivity of 0.74 (95% CI: 0.73−0.74), and specificity of 0.72 (95% CI: 0.72−0.72) in the validation cohort. Important predictors of IBD flare in the RF model were the number of previous flares, age, potassium, and white blood cell count. CONCLUSION: The machine learning modeling framework was replicated and results showed a similar predictive accuracy in a nationally representative cohort of IBD patients. This modeling framework could be embedded in routine practice as a tool to distinguish high-risk patients for disease activity. Public Library of Science 2021-09-20 /pmc/articles/PMC8452029/ /pubmed/34543353 http://dx.doi.org/10.1371/journal.pone.0257520 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Gan, Ryan W.
Sun, Diana
Tatro, Amanda R.
Cohen-Mekelburg, Shirley
Wiitala, Wyndy L.
Zhu, Ji
Waljee, Akbar K.
Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease
title Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease
title_full Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease
title_fullStr Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease
title_full_unstemmed Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease
title_short Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease
title_sort replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8452029/
https://www.ncbi.nlm.nih.gov/pubmed/34543353
http://dx.doi.org/10.1371/journal.pone.0257520
work_keys_str_mv AT ganryanw replicatingpredictionalgorithmsforhospitalizationandcorticosteroiduseinpatientswithinflammatoryboweldisease
AT sundiana replicatingpredictionalgorithmsforhospitalizationandcorticosteroiduseinpatientswithinflammatoryboweldisease
AT tatroamandar replicatingpredictionalgorithmsforhospitalizationandcorticosteroiduseinpatientswithinflammatoryboweldisease
AT cohenmekelburgshirley replicatingpredictionalgorithmsforhospitalizationandcorticosteroiduseinpatientswithinflammatoryboweldisease
AT wiitalawyndyl replicatingpredictionalgorithmsforhospitalizationandcorticosteroiduseinpatientswithinflammatoryboweldisease
AT zhuji replicatingpredictionalgorithmsforhospitalizationandcorticosteroiduseinpatientswithinflammatoryboweldisease
AT waljeeakbark replicatingpredictionalgorithmsforhospitalizationandcorticosteroiduseinpatientswithinflammatoryboweldisease