Cargando…

Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study

INTRODUCTION: In order to identify and evaluate candidate algorithms to detect COVID-19 cases in an electronic health record (EHR) database, this study examined and compared the utilization of acute respiratory disease codes from February to August 2020 versus the corresponding time period in the 3...

Descripción completa

Detalles Bibliográficos
Autores principales: Brown, Carolyn A, Londhe, Ajit A, He, Fang, Cheng, Alvan, Ma, Junjie, Zhang, Jie, Brooks, Corinne G, Sprafka, J Michael, Roehl, Kimberly A, Carlson, Katherine B, Page, John H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139367/
https://www.ncbi.nlm.nih.gov/pubmed/35633659
http://dx.doi.org/10.2147/CLEP.S355086
_version_ 1784714843057553408
author Brown, Carolyn A
Londhe, Ajit A
He, Fang
Cheng, Alvan
Ma, Junjie
Zhang, Jie
Brooks, Corinne G
Sprafka, J Michael
Roehl, Kimberly A
Carlson, Katherine B
Page, John H
author_facet Brown, Carolyn A
Londhe, Ajit A
He, Fang
Cheng, Alvan
Ma, Junjie
Zhang, Jie
Brooks, Corinne G
Sprafka, J Michael
Roehl, Kimberly A
Carlson, Katherine B
Page, John H
author_sort Brown, Carolyn A
collection PubMed
description INTRODUCTION: In order to identify and evaluate candidate algorithms to detect COVID-19 cases in an electronic health record (EHR) database, this study examined and compared the utilization of acute respiratory disease codes from February to August 2020 versus the corresponding time period in the 3 years preceding. METHODS: De-identified EHR data were used to identify codes of interest for candidate algorithms to identify COVID-19 patients. The number and proportion of patients who received a SARS-CoV-2 reverse transcriptase polymerase chain reaction (RT-PCR) within ±10 days of the occurrence of the diagnosis code and patients who tested positive among those with a test result were calculated, resulting in 11 candidate algorithms. Sensitivity, specificity, and likelihood ratios assessed the candidate algorithms by clinical setting and time period. We adjusted for potential verification bias by weighting by the reciprocal of the estimated probability of verification. RESULTS: From January to March 2020, the most commonly used diagnosis codes related to COVID-19 diagnosis were R06 (dyspnea) and R05 (cough). On or after April 1, 2020, the code with highest sensitivity for COVID-19, U07.1, had near perfect adjusted sensitivity (1.00 [95% CI 1.00, 1.00]) but low adjusted specificity (0.32 [95% CI 0.31, 0.33]) in hospitalized patients. DISCUSSION: Algorithms based on the U07.1 code had high sensitivity among hospitalized patients, but low specificity, especially after April 2020. None of the combinations of ICD-10-CM codes assessed performed with a satisfactory combination of high sensitivity and high specificity when using the SARS-CoV-2 RT-PCR as the reference standard.
format Online
Article
Text
id pubmed-9139367
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-91393672022-05-28 Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study Brown, Carolyn A Londhe, Ajit A He, Fang Cheng, Alvan Ma, Junjie Zhang, Jie Brooks, Corinne G Sprafka, J Michael Roehl, Kimberly A Carlson, Katherine B Page, John H Clin Epidemiol Original Research INTRODUCTION: In order to identify and evaluate candidate algorithms to detect COVID-19 cases in an electronic health record (EHR) database, this study examined and compared the utilization of acute respiratory disease codes from February to August 2020 versus the corresponding time period in the 3 years preceding. METHODS: De-identified EHR data were used to identify codes of interest for candidate algorithms to identify COVID-19 patients. The number and proportion of patients who received a SARS-CoV-2 reverse transcriptase polymerase chain reaction (RT-PCR) within ±10 days of the occurrence of the diagnosis code and patients who tested positive among those with a test result were calculated, resulting in 11 candidate algorithms. Sensitivity, specificity, and likelihood ratios assessed the candidate algorithms by clinical setting and time period. We adjusted for potential verification bias by weighting by the reciprocal of the estimated probability of verification. RESULTS: From January to March 2020, the most commonly used diagnosis codes related to COVID-19 diagnosis were R06 (dyspnea) and R05 (cough). On or after April 1, 2020, the code with highest sensitivity for COVID-19, U07.1, had near perfect adjusted sensitivity (1.00 [95% CI 1.00, 1.00]) but low adjusted specificity (0.32 [95% CI 0.31, 0.33]) in hospitalized patients. DISCUSSION: Algorithms based on the U07.1 code had high sensitivity among hospitalized patients, but low specificity, especially after April 2020. None of the combinations of ICD-10-CM codes assessed performed with a satisfactory combination of high sensitivity and high specificity when using the SARS-CoV-2 RT-PCR as the reference standard. Dove 2022-05-23 /pmc/articles/PMC9139367/ /pubmed/35633659 http://dx.doi.org/10.2147/CLEP.S355086 Text en © 2022 Brown et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Brown, Carolyn A
Londhe, Ajit A
He, Fang
Cheng, Alvan
Ma, Junjie
Zhang, Jie
Brooks, Corinne G
Sprafka, J Michael
Roehl, Kimberly A
Carlson, Katherine B
Page, John H
Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study
title Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study
title_full Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study
title_fullStr Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study
title_full_unstemmed Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study
title_short Development and Validation of Algorithms to Identify COVID-19 Patients Using a US Electronic Health Records Database: A Retrospective Cohort Study
title_sort development and validation of algorithms to identify covid-19 patients using a us electronic health records database: a retrospective cohort study
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139367/
https://www.ncbi.nlm.nih.gov/pubmed/35633659
http://dx.doi.org/10.2147/CLEP.S355086
work_keys_str_mv AT browncarolyna developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT londheajita developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT hefang developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT chengalvan developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT majunjie developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT zhangjie developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT brookscorinneg developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT sprafkajmichael developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT roehlkimberlya developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT carlsonkatherineb developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy
AT pagejohnh developmentandvalidationofalgorithmstoidentifycovid19patientsusingauselectronichealthrecordsdatabasearetrospectivecohortstudy