Cargando…

Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group

OBJECTIVE: Long-term follow-up of population-based prospective studies is often achieved through linkages to coded regional or national health care data. Our knowledge of the accuracy of such data is incomplete. To inform methods for identifying stroke cases in UK Biobank (a prospective study of 503...

Descripción completa

Detalles Bibliográficos
Autores principales: Woodfield, Rebecca, Grant, Ian, Sudlow, Cathie L. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619732/
https://www.ncbi.nlm.nih.gov/pubmed/26496350
http://dx.doi.org/10.1371/journal.pone.0140533
_version_ 1782397170544017408
author Woodfield, Rebecca
Grant, Ian
Sudlow, Cathie L. M.
author_facet Woodfield, Rebecca
Grant, Ian
Sudlow, Cathie L. M.
author_sort Woodfield, Rebecca
collection PubMed
description OBJECTIVE: Long-term follow-up of population-based prospective studies is often achieved through linkages to coded regional or national health care data. Our knowledge of the accuracy of such data is incomplete. To inform methods for identifying stroke cases in UK Biobank (a prospective study of 503,000 UK adults recruited in middle-age), we systematically evaluated the accuracy of these data for stroke and its main pathological types (ischaemic stroke, intracerebral haemorrhage, subarachnoid haemorrhage), determining the optimum codes for case identification. METHODS: We sought studies published from 1990-November 2013, which compared coded data from death certificates, hospital admissions or primary care with a reference standard for stroke or its pathological types. We extracted information on a range of study characteristics and assessed study quality with the Quality Assessment of Diagnostic Studies tool (QUADAS-2). To assess accuracy, we extracted data on positive predictive values (PPV) and—where available—on sensitivity, specificity, and negative predictive values (NPV). RESULTS: 37 of 39 eligible studies assessed accuracy of International Classification of Diseases (ICD)-coded hospital or death certificate data. They varied widely in their settings, methods, reporting, quality, and in the choice and accuracy of codes. Although PPVs for stroke and its pathological types ranged from 6–97%, appropriately selected, stroke-specific codes (rather than broad cerebrovascular codes) consistently produced PPVs >70%, and in several studies >90%. The few studies with data on sensitivity, specificity and NPV showed higher sensitivity of hospital versus death certificate data for stroke, with specificity and NPV consistently >96%. Few studies assessed either primary care data or combinations of data sources. CONCLUSIONS: Particular stroke-specific codes can yield high PPVs (>90%) for stroke/stroke types. Inclusion of primary care data and combining data sources should improve accuracy in large epidemiological studies, but there is limited published information about these strategies.
format Online
Article
Text
id pubmed-4619732
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46197322015-10-29 Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group Woodfield, Rebecca Grant, Ian Sudlow, Cathie L. M. PLoS One Research Article OBJECTIVE: Long-term follow-up of population-based prospective studies is often achieved through linkages to coded regional or national health care data. Our knowledge of the accuracy of such data is incomplete. To inform methods for identifying stroke cases in UK Biobank (a prospective study of 503,000 UK adults recruited in middle-age), we systematically evaluated the accuracy of these data for stroke and its main pathological types (ischaemic stroke, intracerebral haemorrhage, subarachnoid haemorrhage), determining the optimum codes for case identification. METHODS: We sought studies published from 1990-November 2013, which compared coded data from death certificates, hospital admissions or primary care with a reference standard for stroke or its pathological types. We extracted information on a range of study characteristics and assessed study quality with the Quality Assessment of Diagnostic Studies tool (QUADAS-2). To assess accuracy, we extracted data on positive predictive values (PPV) and—where available—on sensitivity, specificity, and negative predictive values (NPV). RESULTS: 37 of 39 eligible studies assessed accuracy of International Classification of Diseases (ICD)-coded hospital or death certificate data. They varied widely in their settings, methods, reporting, quality, and in the choice and accuracy of codes. Although PPVs for stroke and its pathological types ranged from 6–97%, appropriately selected, stroke-specific codes (rather than broad cerebrovascular codes) consistently produced PPVs >70%, and in several studies >90%. The few studies with data on sensitivity, specificity and NPV showed higher sensitivity of hospital versus death certificate data for stroke, with specificity and NPV consistently >96%. Few studies assessed either primary care data or combinations of data sources. CONCLUSIONS: Particular stroke-specific codes can yield high PPVs (>90%) for stroke/stroke types. Inclusion of primary care data and combining data sources should improve accuracy in large epidemiological studies, but there is limited published information about these strategies. Public Library of Science 2015-10-23 /pmc/articles/PMC4619732/ /pubmed/26496350 http://dx.doi.org/10.1371/journal.pone.0140533 Text en © 2015 Woodfield et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Woodfield, Rebecca
Grant, Ian
Sudlow, Cathie L. M.
Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group
title Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group
title_full Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group
title_fullStr Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group
title_full_unstemmed Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group
title_short Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group
title_sort accuracy of electronic health record data for identifying stroke cases in large-scale epidemiological studies: a systematic review from the uk biobank stroke outcomes group
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619732/
https://www.ncbi.nlm.nih.gov/pubmed/26496350
http://dx.doi.org/10.1371/journal.pone.0140533
work_keys_str_mv AT woodfieldrebecca accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup
AT grantian accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup
AT accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup
AT accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup
AT sudlowcathielm accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup