Cargando…
Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning
The most accurate prognostic approach for follicular lymphoma (FL), progression of disease at 24 months (POD24), requires two years’ observation after initiating first-line therapy (L1) to predict outcomes. We applied machine learning to structured electronic health record (EHR) data to predict indi...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967359/ https://www.ncbi.nlm.nih.gov/pubmed/33799968 http://dx.doi.org/10.3390/ijerph18052679 |
_version_ | 1783665857568702464 |
---|---|
author | Li, Chunyang Patil, Vikas Rasmussen, Kelli M. Yong, Christina Chien, Hsu-Chih Morreall, Debbie Humpherys, Jeffrey Sauer, Brian C. Burningham, Zachary Halwani, Ahmad S. |
author_facet | Li, Chunyang Patil, Vikas Rasmussen, Kelli M. Yong, Christina Chien, Hsu-Chih Morreall, Debbie Humpherys, Jeffrey Sauer, Brian C. Burningham, Zachary Halwani, Ahmad S. |
author_sort | Li, Chunyang |
collection | PubMed |
description | The most accurate prognostic approach for follicular lymphoma (FL), progression of disease at 24 months (POD24), requires two years’ observation after initiating first-line therapy (L1) to predict outcomes. We applied machine learning to structured electronic health record (EHR) data to predict individual survival at L1 initiation. We grouped 523 observations and 1933 variables from a nationwide cohort of FL patients diagnosed 2006–2014 in the Veterans Health Administration into traditionally used prognostic variables (“curated”), commonly measured labs (“labs”), and International Classification of Diseases diagnostic codes (“ICD”) sets. We compared performance of random survival forests (RSF) vs. traditional Cox model using four datasets: curated, curated + labs, curated + ICD, and curated + ICD + labs, also using Cox on curated + POD24. We evaluated variable importance and partial dependence plots with area under the receiver operating characteristic curve (AUC). RSF with curated + labs performed best, with mean AUC 0.73 (95% CI: 0.71–0.75). It approximated, but did not surpass, Cox with POD24 (mean AUC 0.74 [95% CI: 0.71–0.77]). RSF using EHR data achieved better performance than traditional prognostic variables, setting the foundation for the incorporation of our algorithm into the EHR. It also provides for possible future scenarios in which clinicians could be provided an EHR-based tool which approximates the predictive ability of the most accurate known indicator, using information available 24 months earlier. |
format | Online Article Text |
id | pubmed-7967359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-79673592021-03-18 Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning Li, Chunyang Patil, Vikas Rasmussen, Kelli M. Yong, Christina Chien, Hsu-Chih Morreall, Debbie Humpherys, Jeffrey Sauer, Brian C. Burningham, Zachary Halwani, Ahmad S. Int J Environ Res Public Health Article The most accurate prognostic approach for follicular lymphoma (FL), progression of disease at 24 months (POD24), requires two years’ observation after initiating first-line therapy (L1) to predict outcomes. We applied machine learning to structured electronic health record (EHR) data to predict individual survival at L1 initiation. We grouped 523 observations and 1933 variables from a nationwide cohort of FL patients diagnosed 2006–2014 in the Veterans Health Administration into traditionally used prognostic variables (“curated”), commonly measured labs (“labs”), and International Classification of Diseases diagnostic codes (“ICD”) sets. We compared performance of random survival forests (RSF) vs. traditional Cox model using four datasets: curated, curated + labs, curated + ICD, and curated + ICD + labs, also using Cox on curated + POD24. We evaluated variable importance and partial dependence plots with area under the receiver operating characteristic curve (AUC). RSF with curated + labs performed best, with mean AUC 0.73 (95% CI: 0.71–0.75). It approximated, but did not surpass, Cox with POD24 (mean AUC 0.74 [95% CI: 0.71–0.77]). RSF using EHR data achieved better performance than traditional prognostic variables, setting the foundation for the incorporation of our algorithm into the EHR. It also provides for possible future scenarios in which clinicians could be provided an EHR-based tool which approximates the predictive ability of the most accurate known indicator, using information available 24 months earlier. MDPI 2021-03-07 /pmc/articles/PMC7967359/ /pubmed/33799968 http://dx.doi.org/10.3390/ijerph18052679 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Chunyang Patil, Vikas Rasmussen, Kelli M. Yong, Christina Chien, Hsu-Chih Morreall, Debbie Humpherys, Jeffrey Sauer, Brian C. Burningham, Zachary Halwani, Ahmad S. Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning |
title | Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning |
title_full | Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning |
title_fullStr | Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning |
title_full_unstemmed | Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning |
title_short | Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning |
title_sort | predicting survival in veterans with follicular lymphoma using structured electronic health record information and machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967359/ https://www.ncbi.nlm.nih.gov/pubmed/33799968 http://dx.doi.org/10.3390/ijerph18052679 |
work_keys_str_mv | AT lichunyang predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT patilvikas predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT rasmussenkellim predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT yongchristina predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT chienhsuchih predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT morrealldebbie predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT humpherysjeffrey predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT sauerbrianc predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT burninghamzachary predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning AT halwaniahmads predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning |