Cargando…

Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease

INTRODUCTION: Hospitalization is the primary driver of inflammatory bowel disease (IBD)-related healthcare costs and morbidity. Traditional prediction models have poor performance at identifying patients at highest risk of unplanned healthcare utilization. Identification of patients who are high-nee...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Nghia H., Patel, Sagar, Gabunilas, Jason, Qian, Alexander S., Cecil, Alan, Jairath, Vipul, Sandborn, William J., Ohno-Machado, Lucila, Chen, Peter L., Singh, Siddharth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10476830/
https://www.ncbi.nlm.nih.gov/pubmed/35905414
http://dx.doi.org/10.14309/ctg.0000000000000507
_version_ 1785101015083646976
author Nguyen, Nghia H.
Patel, Sagar
Gabunilas, Jason
Qian, Alexander S.
Cecil, Alan
Jairath, Vipul
Sandborn, William J.
Ohno-Machado, Lucila
Chen, Peter L.
Singh, Siddharth
author_facet Nguyen, Nghia H.
Patel, Sagar
Gabunilas, Jason
Qian, Alexander S.
Cecil, Alan
Jairath, Vipul
Sandborn, William J.
Ohno-Machado, Lucila
Chen, Peter L.
Singh, Siddharth
author_sort Nguyen, Nghia H.
collection PubMed
description INTRODUCTION: Hospitalization is the primary driver of inflammatory bowel disease (IBD)-related healthcare costs and morbidity. Traditional prediction models have poor performance at identifying patients at highest risk of unplanned healthcare utilization. Identification of patients who are high-need and high-cost (HNHC) could reduce unplanned healthcare utilization and healthcare costs. METHODS: We conducted a retrospective cohort study in adult patients hospitalized with IBD using the Nationwide Readmissions Database (model derivation in the 2013 Nationwide Readmission Database and validation in the 2017 Nationwide Readmission Database). We built 2 tree-based algorithms (decision tree classifier and decision tree using gradient boosting framework [XGBoost]) and compared traditional logistic regression to identify patients at risk for becoming HNHC (patients in the highest decile of total days spent in hospital in a calendar year). RESULTS: Of 47,402 adult patients hospitalized with IBD, we identified 4,717 HNHC patients. The decision tree classifier model (length of stay, Charlson Comorbidity Index, procedure, Frailty Risk Score, and age) had a mean area under the receiver operating characteristic curve (AUC) of 0.78 ± 0.01 in the derivation data set and 0.78 ± 0.02 in the validation data set. XGBoost (length of stay, procedure, chronic pain, drug abuse, and diabetic complication) had a mean AUC of 0.79 ± 0.01 and 0.75 ± 0.02 in the derivation and validation data sets, respectively, compared with AUC 0.55 ± 0.01 and 0.56 ± 0.01 with traditional logistic regression (peptic ulcer disease, paresthesia, admission for osteomyelitis, renal failure, and lymphoma) in derivation and validation data sets, respectively. DISCUSSION: In hospitalized patients with IBD, simplified tree-based machine learning algorithms using administrative claims data can accurately predict patients at risk of progressing to HNHC.
format Online
Article
Text
id pubmed-10476830
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Wolters Kluwer
record_format MEDLINE/PubMed
spelling pubmed-104768302023-09-05 Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease Nguyen, Nghia H. Patel, Sagar Gabunilas, Jason Qian, Alexander S. Cecil, Alan Jairath, Vipul Sandborn, William J. Ohno-Machado, Lucila Chen, Peter L. Singh, Siddharth Clin Transl Gastroenterol Article INTRODUCTION: Hospitalization is the primary driver of inflammatory bowel disease (IBD)-related healthcare costs and morbidity. Traditional prediction models have poor performance at identifying patients at highest risk of unplanned healthcare utilization. Identification of patients who are high-need and high-cost (HNHC) could reduce unplanned healthcare utilization and healthcare costs. METHODS: We conducted a retrospective cohort study in adult patients hospitalized with IBD using the Nationwide Readmissions Database (model derivation in the 2013 Nationwide Readmission Database and validation in the 2017 Nationwide Readmission Database). We built 2 tree-based algorithms (decision tree classifier and decision tree using gradient boosting framework [XGBoost]) and compared traditional logistic regression to identify patients at risk for becoming HNHC (patients in the highest decile of total days spent in hospital in a calendar year). RESULTS: Of 47,402 adult patients hospitalized with IBD, we identified 4,717 HNHC patients. The decision tree classifier model (length of stay, Charlson Comorbidity Index, procedure, Frailty Risk Score, and age) had a mean area under the receiver operating characteristic curve (AUC) of 0.78 ± 0.01 in the derivation data set and 0.78 ± 0.02 in the validation data set. XGBoost (length of stay, procedure, chronic pain, drug abuse, and diabetic complication) had a mean AUC of 0.79 ± 0.01 and 0.75 ± 0.02 in the derivation and validation data sets, respectively, compared with AUC 0.55 ± 0.01 and 0.56 ± 0.01 with traditional logistic regression (peptic ulcer disease, paresthesia, admission for osteomyelitis, renal failure, and lymphoma) in derivation and validation data sets, respectively. DISCUSSION: In hospitalized patients with IBD, simplified tree-based machine learning algorithms using administrative claims data can accurately predict patients at risk of progressing to HNHC. Wolters Kluwer 2022-06-07 /pmc/articles/PMC10476830/ /pubmed/35905414 http://dx.doi.org/10.14309/ctg.0000000000000507 Text en © 2022 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of The American College of Gastroenterology https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License 4.0 (CCBY) (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Article
Nguyen, Nghia H.
Patel, Sagar
Gabunilas, Jason
Qian, Alexander S.
Cecil, Alan
Jairath, Vipul
Sandborn, William J.
Ohno-Machado, Lucila
Chen, Peter L.
Singh, Siddharth
Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease
title Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease
title_full Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease
title_fullStr Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease
title_full_unstemmed Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease
title_short Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease
title_sort simplified machine learning models can accurately identify high-need high-cost patients with inflammatory bowel disease
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10476830/
https://www.ncbi.nlm.nih.gov/pubmed/35905414
http://dx.doi.org/10.14309/ctg.0000000000000507
work_keys_str_mv AT nguyennghiah simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT patelsagar simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT gabunilasjason simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT qianalexanders simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT cecilalan simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT jairathvipul simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT sandbornwilliamj simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT ohnomachadolucila simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT chenpeterl simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease
AT singhsiddharth simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease