Cargando…

Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning

The most accurate prognostic approach for follicular lymphoma (FL), progression of disease at 24 months (POD24), requires two years’ observation after initiating first-line therapy (L1) to predict outcomes. We applied machine learning to structured electronic health record (EHR) data to predict indi...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Chunyang, Patil, Vikas, Rasmussen, Kelli M., Yong, Christina, Chien, Hsu-Chih, Morreall, Debbie, Humpherys, Jeffrey, Sauer, Brian C., Burningham, Zachary, Halwani, Ahmad S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967359/
https://www.ncbi.nlm.nih.gov/pubmed/33799968
http://dx.doi.org/10.3390/ijerph18052679
_version_ 1783665857568702464
author Li, Chunyang
Patil, Vikas
Rasmussen, Kelli M.
Yong, Christina
Chien, Hsu-Chih
Morreall, Debbie
Humpherys, Jeffrey
Sauer, Brian C.
Burningham, Zachary
Halwani, Ahmad S.
author_facet Li, Chunyang
Patil, Vikas
Rasmussen, Kelli M.
Yong, Christina
Chien, Hsu-Chih
Morreall, Debbie
Humpherys, Jeffrey
Sauer, Brian C.
Burningham, Zachary
Halwani, Ahmad S.
author_sort Li, Chunyang
collection PubMed
description The most accurate prognostic approach for follicular lymphoma (FL), progression of disease at 24 months (POD24), requires two years’ observation after initiating first-line therapy (L1) to predict outcomes. We applied machine learning to structured electronic health record (EHR) data to predict individual survival at L1 initiation. We grouped 523 observations and 1933 variables from a nationwide cohort of FL patients diagnosed 2006–2014 in the Veterans Health Administration into traditionally used prognostic variables (“curated”), commonly measured labs (“labs”), and International Classification of Diseases diagnostic codes (“ICD”) sets. We compared performance of random survival forests (RSF) vs. traditional Cox model using four datasets: curated, curated + labs, curated + ICD, and curated + ICD + labs, also using Cox on curated + POD24. We evaluated variable importance and partial dependence plots with area under the receiver operating characteristic curve (AUC). RSF with curated + labs performed best, with mean AUC 0.73 (95% CI: 0.71–0.75). It approximated, but did not surpass, Cox with POD24 (mean AUC 0.74 [95% CI: 0.71–0.77]). RSF using EHR data achieved better performance than traditional prognostic variables, setting the foundation for the incorporation of our algorithm into the EHR. It also provides for possible future scenarios in which clinicians could be provided an EHR-based tool which approximates the predictive ability of the most accurate known indicator, using information available 24 months earlier.
format Online
Article
Text
id pubmed-7967359
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79673592021-03-18 Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning Li, Chunyang Patil, Vikas Rasmussen, Kelli M. Yong, Christina Chien, Hsu-Chih Morreall, Debbie Humpherys, Jeffrey Sauer, Brian C. Burningham, Zachary Halwani, Ahmad S. Int J Environ Res Public Health Article The most accurate prognostic approach for follicular lymphoma (FL), progression of disease at 24 months (POD24), requires two years’ observation after initiating first-line therapy (L1) to predict outcomes. We applied machine learning to structured electronic health record (EHR) data to predict individual survival at L1 initiation. We grouped 523 observations and 1933 variables from a nationwide cohort of FL patients diagnosed 2006–2014 in the Veterans Health Administration into traditionally used prognostic variables (“curated”), commonly measured labs (“labs”), and International Classification of Diseases diagnostic codes (“ICD”) sets. We compared performance of random survival forests (RSF) vs. traditional Cox model using four datasets: curated, curated + labs, curated + ICD, and curated + ICD + labs, also using Cox on curated + POD24. We evaluated variable importance and partial dependence plots with area under the receiver operating characteristic curve (AUC). RSF with curated + labs performed best, with mean AUC 0.73 (95% CI: 0.71–0.75). It approximated, but did not surpass, Cox with POD24 (mean AUC 0.74 [95% CI: 0.71–0.77]). RSF using EHR data achieved better performance than traditional prognostic variables, setting the foundation for the incorporation of our algorithm into the EHR. It also provides for possible future scenarios in which clinicians could be provided an EHR-based tool which approximates the predictive ability of the most accurate known indicator, using information available 24 months earlier. MDPI 2021-03-07 /pmc/articles/PMC7967359/ /pubmed/33799968 http://dx.doi.org/10.3390/ijerph18052679 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Chunyang
Patil, Vikas
Rasmussen, Kelli M.
Yong, Christina
Chien, Hsu-Chih
Morreall, Debbie
Humpherys, Jeffrey
Sauer, Brian C.
Burningham, Zachary
Halwani, Ahmad S.
Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning
title Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning
title_full Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning
title_fullStr Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning
title_full_unstemmed Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning
title_short Predicting Survival in Veterans with Follicular Lymphoma Using Structured Electronic Health Record Information and Machine Learning
title_sort predicting survival in veterans with follicular lymphoma using structured electronic health record information and machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967359/
https://www.ncbi.nlm.nih.gov/pubmed/33799968
http://dx.doi.org/10.3390/ijerph18052679
work_keys_str_mv AT lichunyang predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT patilvikas predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT rasmussenkellim predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT yongchristina predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT chienhsuchih predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT morrealldebbie predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT humpherysjeffrey predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT sauerbrianc predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT burninghamzachary predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning
AT halwaniahmads predictingsurvivalinveteranswithfollicularlymphomausingstructuredelectronichealthrecordinformationandmachinelearning