Cargando…

The impact of electronic health record discontinuity on prediction modeling

BACKGROUND: To determine the impact of electronic health record (EHR)-discontinuity on the performance of prediction models. METHODS: The study population consisted of patients with a history of cardiovascular (CV) comorbidities identified using US Medicare claims data from 2007 to 2017, linked to E...

Descripción completa

Detalles Bibliográficos
Autores principales: Kar, Shreyas, Bessette, Lily G., Wyss, Richard, Kesselheim, Aaron S., Lin, Kueiyu Joshua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10325091/
https://www.ncbi.nlm.nih.gov/pubmed/37410777
http://dx.doi.org/10.1371/journal.pone.0287985
_version_ 1785069238601383936
author Kar, Shreyas
Bessette, Lily G.
Wyss, Richard
Kesselheim, Aaron S.
Lin, Kueiyu Joshua
author_facet Kar, Shreyas
Bessette, Lily G.
Wyss, Richard
Kesselheim, Aaron S.
Lin, Kueiyu Joshua
author_sort Kar, Shreyas
collection PubMed
description BACKGROUND: To determine the impact of electronic health record (EHR)-discontinuity on the performance of prediction models. METHODS: The study population consisted of patients with a history of cardiovascular (CV) comorbidities identified using US Medicare claims data from 2007 to 2017, linked to EHR from two networks (used as model training and validation set, respectively). We built models predicting one-year risk of mortality, major CV events, and major bleeding events, stratified by high vs. low algorithm-predicted EHR-continuity. The best-performing models for each outcome were chosen among 5 commonly used machine-learning models. We compared model performance by Area under the ROC curve (AUROC) and Area under the precision-recall curve (AUPRC). RESULTS: Based on 180,950 in the training and 103,061 in the validation set, we found EHR captured only 21.0–28.1% of all the non-fatal outcomes in the low EHR-continuity cohort but 55.4–66.1% of that in the high EHR-continuity cohort. In the validation set, the best-performing model developed among high EHR-continuity patients had consistently higher AUROC than that based on low-continuity patients: AUROC was 0.849 vs. 0.743 when predicting mortality; AUROC was 0.802 vs. 0.659 predicting the CV events; AUROC was 0.635 vs. 0.567 predicting major bleeding. We observed a similar pattern when using AUPRC as the outcome metric. CONCLUSIONS: Among patients with CV comorbidities, when predicting mortality, major CV events, and bleeding outcomes, the prediction models developed in datasets with low EHR-continuity consistently had worse performance compared to models developed with high EHR-continuity.
format Online
Article
Text
id pubmed-10325091
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103250912023-07-07 The impact of electronic health record discontinuity on prediction modeling Kar, Shreyas Bessette, Lily G. Wyss, Richard Kesselheim, Aaron S. Lin, Kueiyu Joshua PLoS One Research Article BACKGROUND: To determine the impact of electronic health record (EHR)-discontinuity on the performance of prediction models. METHODS: The study population consisted of patients with a history of cardiovascular (CV) comorbidities identified using US Medicare claims data from 2007 to 2017, linked to EHR from two networks (used as model training and validation set, respectively). We built models predicting one-year risk of mortality, major CV events, and major bleeding events, stratified by high vs. low algorithm-predicted EHR-continuity. The best-performing models for each outcome were chosen among 5 commonly used machine-learning models. We compared model performance by Area under the ROC curve (AUROC) and Area under the precision-recall curve (AUPRC). RESULTS: Based on 180,950 in the training and 103,061 in the validation set, we found EHR captured only 21.0–28.1% of all the non-fatal outcomes in the low EHR-continuity cohort but 55.4–66.1% of that in the high EHR-continuity cohort. In the validation set, the best-performing model developed among high EHR-continuity patients had consistently higher AUROC than that based on low-continuity patients: AUROC was 0.849 vs. 0.743 when predicting mortality; AUROC was 0.802 vs. 0.659 predicting the CV events; AUROC was 0.635 vs. 0.567 predicting major bleeding. We observed a similar pattern when using AUPRC as the outcome metric. CONCLUSIONS: Among patients with CV comorbidities, when predicting mortality, major CV events, and bleeding outcomes, the prediction models developed in datasets with low EHR-continuity consistently had worse performance compared to models developed with high EHR-continuity. Public Library of Science 2023-07-06 /pmc/articles/PMC10325091/ /pubmed/37410777 http://dx.doi.org/10.1371/journal.pone.0287985 Text en © 2023 Kar et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kar, Shreyas
Bessette, Lily G.
Wyss, Richard
Kesselheim, Aaron S.
Lin, Kueiyu Joshua
The impact of electronic health record discontinuity on prediction modeling
title The impact of electronic health record discontinuity on prediction modeling
title_full The impact of electronic health record discontinuity on prediction modeling
title_fullStr The impact of electronic health record discontinuity on prediction modeling
title_full_unstemmed The impact of electronic health record discontinuity on prediction modeling
title_short The impact of electronic health record discontinuity on prediction modeling
title_sort impact of electronic health record discontinuity on prediction modeling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10325091/
https://www.ncbi.nlm.nih.gov/pubmed/37410777
http://dx.doi.org/10.1371/journal.pone.0287985
work_keys_str_mv AT karshreyas theimpactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT bessettelilyg theimpactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT wyssrichard theimpactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT kesselheimaarons theimpactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT linkueiyujoshua theimpactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT karshreyas impactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT bessettelilyg impactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT wyssrichard impactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT kesselheimaarons impactofelectronichealthrecorddiscontinuityonpredictionmodeling
AT linkueiyujoshua impactofelectronichealthrecorddiscontinuityonpredictionmodeling