Cargando…
Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease
INTRODUCTION: Hospitalization is the primary driver of inflammatory bowel disease (IBD)-related healthcare costs and morbidity. Traditional prediction models have poor performance at identifying patients at highest risk of unplanned healthcare utilization. Identification of patients who are high-nee...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Wolters Kluwer
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10476830/ https://www.ncbi.nlm.nih.gov/pubmed/35905414 http://dx.doi.org/10.14309/ctg.0000000000000507 |
_version_ | 1785101015083646976 |
---|---|
author | Nguyen, Nghia H. Patel, Sagar Gabunilas, Jason Qian, Alexander S. Cecil, Alan Jairath, Vipul Sandborn, William J. Ohno-Machado, Lucila Chen, Peter L. Singh, Siddharth |
author_facet | Nguyen, Nghia H. Patel, Sagar Gabunilas, Jason Qian, Alexander S. Cecil, Alan Jairath, Vipul Sandborn, William J. Ohno-Machado, Lucila Chen, Peter L. Singh, Siddharth |
author_sort | Nguyen, Nghia H. |
collection | PubMed |
description | INTRODUCTION: Hospitalization is the primary driver of inflammatory bowel disease (IBD)-related healthcare costs and morbidity. Traditional prediction models have poor performance at identifying patients at highest risk of unplanned healthcare utilization. Identification of patients who are high-need and high-cost (HNHC) could reduce unplanned healthcare utilization and healthcare costs. METHODS: We conducted a retrospective cohort study in adult patients hospitalized with IBD using the Nationwide Readmissions Database (model derivation in the 2013 Nationwide Readmission Database and validation in the 2017 Nationwide Readmission Database). We built 2 tree-based algorithms (decision tree classifier and decision tree using gradient boosting framework [XGBoost]) and compared traditional logistic regression to identify patients at risk for becoming HNHC (patients in the highest decile of total days spent in hospital in a calendar year). RESULTS: Of 47,402 adult patients hospitalized with IBD, we identified 4,717 HNHC patients. The decision tree classifier model (length of stay, Charlson Comorbidity Index, procedure, Frailty Risk Score, and age) had a mean area under the receiver operating characteristic curve (AUC) of 0.78 ± 0.01 in the derivation data set and 0.78 ± 0.02 in the validation data set. XGBoost (length of stay, procedure, chronic pain, drug abuse, and diabetic complication) had a mean AUC of 0.79 ± 0.01 and 0.75 ± 0.02 in the derivation and validation data sets, respectively, compared with AUC 0.55 ± 0.01 and 0.56 ± 0.01 with traditional logistic regression (peptic ulcer disease, paresthesia, admission for osteomyelitis, renal failure, and lymphoma) in derivation and validation data sets, respectively. DISCUSSION: In hospitalized patients with IBD, simplified tree-based machine learning algorithms using administrative claims data can accurately predict patients at risk of progressing to HNHC. |
format | Online Article Text |
id | pubmed-10476830 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Wolters Kluwer |
record_format | MEDLINE/PubMed |
spelling | pubmed-104768302023-09-05 Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease Nguyen, Nghia H. Patel, Sagar Gabunilas, Jason Qian, Alexander S. Cecil, Alan Jairath, Vipul Sandborn, William J. Ohno-Machado, Lucila Chen, Peter L. Singh, Siddharth Clin Transl Gastroenterol Article INTRODUCTION: Hospitalization is the primary driver of inflammatory bowel disease (IBD)-related healthcare costs and morbidity. Traditional prediction models have poor performance at identifying patients at highest risk of unplanned healthcare utilization. Identification of patients who are high-need and high-cost (HNHC) could reduce unplanned healthcare utilization and healthcare costs. METHODS: We conducted a retrospective cohort study in adult patients hospitalized with IBD using the Nationwide Readmissions Database (model derivation in the 2013 Nationwide Readmission Database and validation in the 2017 Nationwide Readmission Database). We built 2 tree-based algorithms (decision tree classifier and decision tree using gradient boosting framework [XGBoost]) and compared traditional logistic regression to identify patients at risk for becoming HNHC (patients in the highest decile of total days spent in hospital in a calendar year). RESULTS: Of 47,402 adult patients hospitalized with IBD, we identified 4,717 HNHC patients. The decision tree classifier model (length of stay, Charlson Comorbidity Index, procedure, Frailty Risk Score, and age) had a mean area under the receiver operating characteristic curve (AUC) of 0.78 ± 0.01 in the derivation data set and 0.78 ± 0.02 in the validation data set. XGBoost (length of stay, procedure, chronic pain, drug abuse, and diabetic complication) had a mean AUC of 0.79 ± 0.01 and 0.75 ± 0.02 in the derivation and validation data sets, respectively, compared with AUC 0.55 ± 0.01 and 0.56 ± 0.01 with traditional logistic regression (peptic ulcer disease, paresthesia, admission for osteomyelitis, renal failure, and lymphoma) in derivation and validation data sets, respectively. DISCUSSION: In hospitalized patients with IBD, simplified tree-based machine learning algorithms using administrative claims data can accurately predict patients at risk of progressing to HNHC. Wolters Kluwer 2022-06-07 /pmc/articles/PMC10476830/ /pubmed/35905414 http://dx.doi.org/10.14309/ctg.0000000000000507 Text en © 2022 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of The American College of Gastroenterology https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License 4.0 (CCBY) (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Article Nguyen, Nghia H. Patel, Sagar Gabunilas, Jason Qian, Alexander S. Cecil, Alan Jairath, Vipul Sandborn, William J. Ohno-Machado, Lucila Chen, Peter L. Singh, Siddharth Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease |
title | Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease |
title_full | Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease |
title_fullStr | Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease |
title_full_unstemmed | Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease |
title_short | Simplified Machine Learning Models Can Accurately Identify High-Need High-Cost Patients With Inflammatory Bowel Disease |
title_sort | simplified machine learning models can accurately identify high-need high-cost patients with inflammatory bowel disease |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10476830/ https://www.ncbi.nlm.nih.gov/pubmed/35905414 http://dx.doi.org/10.14309/ctg.0000000000000507 |
work_keys_str_mv | AT nguyennghiah simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT patelsagar simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT gabunilasjason simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT qianalexanders simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT cecilalan simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT jairathvipul simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT sandbornwilliamj simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT ohnomachadolucila simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT chenpeterl simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease AT singhsiddharth simplifiedmachinelearningmodelscanaccuratelyidentifyhighneedhighcostpatientswithinflammatoryboweldisease |