Cargando…
Leveraging electronic health records data to predict multiple sclerosis disease activity
OBJECTIVE: No relapse risk prediction tool is currently available to guide treatment selection for multiple sclerosis (MS). Leveraging electronic health record (EHR) data readily available at the point of care, we developed a clinical tool for predicting MS relapse risk. METHODS: Using data from a c...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8045951/ https://www.ncbi.nlm.nih.gov/pubmed/33626237 http://dx.doi.org/10.1002/acn3.51324 |
_version_ | 1783678757639290880 |
---|---|
author | Ahuja, Yuri Kim, Nicole Liang, Liang Cai, Tianrun Dahal, Kumar Seyok, Thany Lin, Chen Finan, Sean Liao, Katherine Savovoa, Guergana Chitnis, Tanuja Cai, Tianxi Xia, Zongqi |
author_facet | Ahuja, Yuri Kim, Nicole Liang, Liang Cai, Tianrun Dahal, Kumar Seyok, Thany Lin, Chen Finan, Sean Liao, Katherine Savovoa, Guergana Chitnis, Tanuja Cai, Tianxi Xia, Zongqi |
author_sort | Ahuja, Yuri |
collection | PubMed |
description | OBJECTIVE: No relapse risk prediction tool is currently available to guide treatment selection for multiple sclerosis (MS). Leveraging electronic health record (EHR) data readily available at the point of care, we developed a clinical tool for predicting MS relapse risk. METHODS: Using data from a clinic‐based research registry and linked EHR system between 2006 and 2016, we developed models predicting relapse events from the registry in a training set (n = 1435) and tested the model performance in an independent validation set of MS patients (n = 186). This iterative process identified prior 1‐year relapse history as a key predictor of future relapse but ascertaining relapse history through the labor‐intensive chart review is impractical. We pursued two‐stage algorithm development: (1) L(1)‐regularized logistic regression (LASSO) to phenotype past 1‐year relapse status from contemporaneous EHR data, (2) LASSO to predict future 1‐year relapse risk using imputed prior 1‐year relapse status and other algorithm‐selected features. RESULTS: The final model, comprising age, disease duration, and imputed prior 1‐year relapse history, achieved a predictive AUC and F score of 0.707 and 0.307, respectively. The performance was significantly better than the baseline model (age, sex, race/ethnicity, and disease duration) and noninferior to a model containing actual prior 1‐year relapse history. The predicted risk probability declined with disease duration and age. CONCLUSION: Our novel machine‐learning algorithm predicts 1‐year MS relapse with accuracy comparable to other clinical prediction tools and has applicability at the point of care. This EHR‐based two‐stage approach of outcome prediction may have application to neurological disease beyond MS. |
format | Online Article Text |
id | pubmed-8045951 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80459512021-04-16 Leveraging electronic health records data to predict multiple sclerosis disease activity Ahuja, Yuri Kim, Nicole Liang, Liang Cai, Tianrun Dahal, Kumar Seyok, Thany Lin, Chen Finan, Sean Liao, Katherine Savovoa, Guergana Chitnis, Tanuja Cai, Tianxi Xia, Zongqi Ann Clin Transl Neurol Research Articles OBJECTIVE: No relapse risk prediction tool is currently available to guide treatment selection for multiple sclerosis (MS). Leveraging electronic health record (EHR) data readily available at the point of care, we developed a clinical tool for predicting MS relapse risk. METHODS: Using data from a clinic‐based research registry and linked EHR system between 2006 and 2016, we developed models predicting relapse events from the registry in a training set (n = 1435) and tested the model performance in an independent validation set of MS patients (n = 186). This iterative process identified prior 1‐year relapse history as a key predictor of future relapse but ascertaining relapse history through the labor‐intensive chart review is impractical. We pursued two‐stage algorithm development: (1) L(1)‐regularized logistic regression (LASSO) to phenotype past 1‐year relapse status from contemporaneous EHR data, (2) LASSO to predict future 1‐year relapse risk using imputed prior 1‐year relapse status and other algorithm‐selected features. RESULTS: The final model, comprising age, disease duration, and imputed prior 1‐year relapse history, achieved a predictive AUC and F score of 0.707 and 0.307, respectively. The performance was significantly better than the baseline model (age, sex, race/ethnicity, and disease duration) and noninferior to a model containing actual prior 1‐year relapse history. The predicted risk probability declined with disease duration and age. CONCLUSION: Our novel machine‐learning algorithm predicts 1‐year MS relapse with accuracy comparable to other clinical prediction tools and has applicability at the point of care. This EHR‐based two‐stage approach of outcome prediction may have application to neurological disease beyond MS. John Wiley and Sons Inc. 2021-02-24 /pmc/articles/PMC8045951/ /pubmed/33626237 http://dx.doi.org/10.1002/acn3.51324 Text en © 2021 The Authors. Annals of Clinical and Translational Neurology published by Wiley Periodicals LLC on behalf of American Neurological Association https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Research Articles Ahuja, Yuri Kim, Nicole Liang, Liang Cai, Tianrun Dahal, Kumar Seyok, Thany Lin, Chen Finan, Sean Liao, Katherine Savovoa, Guergana Chitnis, Tanuja Cai, Tianxi Xia, Zongqi Leveraging electronic health records data to predict multiple sclerosis disease activity |
title | Leveraging electronic health records data to predict multiple sclerosis disease activity |
title_full | Leveraging electronic health records data to predict multiple sclerosis disease activity |
title_fullStr | Leveraging electronic health records data to predict multiple sclerosis disease activity |
title_full_unstemmed | Leveraging electronic health records data to predict multiple sclerosis disease activity |
title_short | Leveraging electronic health records data to predict multiple sclerosis disease activity |
title_sort | leveraging electronic health records data to predict multiple sclerosis disease activity |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8045951/ https://www.ncbi.nlm.nih.gov/pubmed/33626237 http://dx.doi.org/10.1002/acn3.51324 |
work_keys_str_mv | AT ahujayuri leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT kimnicole leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT liangliang leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT caitianrun leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT dahalkumar leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT seyokthany leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT linchen leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT finansean leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT liaokatherine leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT savovoaguergana leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT chitnistanuja leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT caitianxi leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity AT xiazongqi leveragingelectronichealthrecordsdatatopredictmultiplesclerosisdiseaseactivity |