Cargando…

Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record

BACKGROUND: Electronic medical records (EMRs) allow identification of disease‐specific patient populations, but varying electronic cohort definitions could result in different populations. We compared the characteristics of an electronic medical record–derived atrial fibrillation (AF) patient popula...

Descripción completa

Detalles Bibliográficos
Autores principales: Shah, Rashmee U., Mukherjee, Rebeka, Zhang, Yue, Jones, Aubrey E., Springer, Jennifer, Hackett, Ian, Steinberg, Benjamin A., Lloyd‐Jones, Donald M., Chapman, Wendy W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7335556/
https://www.ncbi.nlm.nih.gov/pubmed/32098599
http://dx.doi.org/10.1161/JAHA.119.014527
_version_ 1783554162216140800
author Shah, Rashmee U.
Mukherjee, Rebeka
Zhang, Yue
Jones, Aubrey E.
Springer, Jennifer
Hackett, Ian
Steinberg, Benjamin A.
Lloyd‐Jones, Donald M.
Chapman, Wendy W.
author_facet Shah, Rashmee U.
Mukherjee, Rebeka
Zhang, Yue
Jones, Aubrey E.
Springer, Jennifer
Hackett, Ian
Steinberg, Benjamin A.
Lloyd‐Jones, Donald M.
Chapman, Wendy W.
author_sort Shah, Rashmee U.
collection PubMed
description BACKGROUND: Electronic medical records (EMRs) allow identification of disease‐specific patient populations, but varying electronic cohort definitions could result in different populations. We compared the characteristics of an electronic medical record–derived atrial fibrillation (AF) patient population using 5 different electronic cohort definitions. METHODS AND RESULTS: Adult patients with at least 1 AF billing code from January 1, 2010, to December 31, 2017, were included. Based on different electronic cohort definitions, we trained 5 different logistic regression models using a labeled training data set (n=786). Each model yielded a predicted probability; patients were classified as having AF if the probability was higher than a specified cut point. Test characteristics were calculated for each model. These models were then applied to the full cohort and resulting characteristics were compared. In the training set, the comprehensive model (including demographics, billing codes, and natural language processing results) performed best, with an area under the curve of 0.89, sensitivity of 0.90, and specificity of 0.87. Among a candidate population (n=22 000), the proportion of patients identified as having AF varied from 61% in the model using diagnosis or procedure International Classification of Diseases (ICD) billing codes to 83% in the model using natural language processing of clinical notes. Among identified AF patients, the proportion of patients with a CHA (2) DS (2)‐VASc score ≥2 varied from 69% to 85%; oral anticoagulant treatment rates varied from 50% to 66% depending on the model. CONCLUSIONS: Different electronic cohort definitions result in substantially different AF study samples. This difference threatens the quality and reproducibility of electronic medical record–based research and quality initiatives.
format Online
Article
Text
id pubmed-7335556
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-73355562020-07-08 Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record Shah, Rashmee U. Mukherjee, Rebeka Zhang, Yue Jones, Aubrey E. Springer, Jennifer Hackett, Ian Steinberg, Benjamin A. Lloyd‐Jones, Donald M. Chapman, Wendy W. J Am Heart Assoc Original Research BACKGROUND: Electronic medical records (EMRs) allow identification of disease‐specific patient populations, but varying electronic cohort definitions could result in different populations. We compared the characteristics of an electronic medical record–derived atrial fibrillation (AF) patient population using 5 different electronic cohort definitions. METHODS AND RESULTS: Adult patients with at least 1 AF billing code from January 1, 2010, to December 31, 2017, were included. Based on different electronic cohort definitions, we trained 5 different logistic regression models using a labeled training data set (n=786). Each model yielded a predicted probability; patients were classified as having AF if the probability was higher than a specified cut point. Test characteristics were calculated for each model. These models were then applied to the full cohort and resulting characteristics were compared. In the training set, the comprehensive model (including demographics, billing codes, and natural language processing results) performed best, with an area under the curve of 0.89, sensitivity of 0.90, and specificity of 0.87. Among a candidate population (n=22 000), the proportion of patients identified as having AF varied from 61% in the model using diagnosis or procedure International Classification of Diseases (ICD) billing codes to 83% in the model using natural language processing of clinical notes. Among identified AF patients, the proportion of patients with a CHA (2) DS (2)‐VASc score ≥2 varied from 69% to 85%; oral anticoagulant treatment rates varied from 50% to 66% depending on the model. CONCLUSIONS: Different electronic cohort definitions result in substantially different AF study samples. This difference threatens the quality and reproducibility of electronic medical record–based research and quality initiatives. John Wiley and Sons Inc. 2020-02-26 /pmc/articles/PMC7335556/ /pubmed/32098599 http://dx.doi.org/10.1161/JAHA.119.014527 Text en © 2020 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Original Research
Shah, Rashmee U.
Mukherjee, Rebeka
Zhang, Yue
Jones, Aubrey E.
Springer, Jennifer
Hackett, Ian
Steinberg, Benjamin A.
Lloyd‐Jones, Donald M.
Chapman, Wendy W.
Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record
title Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record
title_full Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record
title_fullStr Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record
title_full_unstemmed Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record
title_short Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record
title_sort impact of different electronic cohort definitions to identify patients with atrial fibrillation from the electronic medical record
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7335556/
https://www.ncbi.nlm.nih.gov/pubmed/32098599
http://dx.doi.org/10.1161/JAHA.119.014527
work_keys_str_mv AT shahrashmeeu impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT mukherjeerebeka impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT zhangyue impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT jonesaubreye impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT springerjennifer impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT hackettian impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT steinbergbenjamina impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT lloydjonesdonaldm impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord
AT chapmanwendyw impactofdifferentelectroniccohortdefinitionstoidentifypatientswithatrialfibrillationfromtheelectronicmedicalrecord