Cargando…
Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group
OBJECTIVE: Long-term follow-up of population-based prospective studies is often achieved through linkages to coded regional or national health care data. Our knowledge of the accuracy of such data is incomplete. To inform methods for identifying stroke cases in UK Biobank (a prospective study of 503...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619732/ https://www.ncbi.nlm.nih.gov/pubmed/26496350 http://dx.doi.org/10.1371/journal.pone.0140533 |
_version_ | 1782397170544017408 |
---|---|
author | Woodfield, Rebecca Grant, Ian Sudlow, Cathie L. M. |
author_facet | Woodfield, Rebecca Grant, Ian Sudlow, Cathie L. M. |
author_sort | Woodfield, Rebecca |
collection | PubMed |
description | OBJECTIVE: Long-term follow-up of population-based prospective studies is often achieved through linkages to coded regional or national health care data. Our knowledge of the accuracy of such data is incomplete. To inform methods for identifying stroke cases in UK Biobank (a prospective study of 503,000 UK adults recruited in middle-age), we systematically evaluated the accuracy of these data for stroke and its main pathological types (ischaemic stroke, intracerebral haemorrhage, subarachnoid haemorrhage), determining the optimum codes for case identification. METHODS: We sought studies published from 1990-November 2013, which compared coded data from death certificates, hospital admissions or primary care with a reference standard for stroke or its pathological types. We extracted information on a range of study characteristics and assessed study quality with the Quality Assessment of Diagnostic Studies tool (QUADAS-2). To assess accuracy, we extracted data on positive predictive values (PPV) and—where available—on sensitivity, specificity, and negative predictive values (NPV). RESULTS: 37 of 39 eligible studies assessed accuracy of International Classification of Diseases (ICD)-coded hospital or death certificate data. They varied widely in their settings, methods, reporting, quality, and in the choice and accuracy of codes. Although PPVs for stroke and its pathological types ranged from 6–97%, appropriately selected, stroke-specific codes (rather than broad cerebrovascular codes) consistently produced PPVs >70%, and in several studies >90%. The few studies with data on sensitivity, specificity and NPV showed higher sensitivity of hospital versus death certificate data for stroke, with specificity and NPV consistently >96%. Few studies assessed either primary care data or combinations of data sources. CONCLUSIONS: Particular stroke-specific codes can yield high PPVs (>90%) for stroke/stroke types. Inclusion of primary care data and combining data sources should improve accuracy in large epidemiological studies, but there is limited published information about these strategies. |
format | Online Article Text |
id | pubmed-4619732 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-46197322015-10-29 Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group Woodfield, Rebecca Grant, Ian Sudlow, Cathie L. M. PLoS One Research Article OBJECTIVE: Long-term follow-up of population-based prospective studies is often achieved through linkages to coded regional or national health care data. Our knowledge of the accuracy of such data is incomplete. To inform methods for identifying stroke cases in UK Biobank (a prospective study of 503,000 UK adults recruited in middle-age), we systematically evaluated the accuracy of these data for stroke and its main pathological types (ischaemic stroke, intracerebral haemorrhage, subarachnoid haemorrhage), determining the optimum codes for case identification. METHODS: We sought studies published from 1990-November 2013, which compared coded data from death certificates, hospital admissions or primary care with a reference standard for stroke or its pathological types. We extracted information on a range of study characteristics and assessed study quality with the Quality Assessment of Diagnostic Studies tool (QUADAS-2). To assess accuracy, we extracted data on positive predictive values (PPV) and—where available—on sensitivity, specificity, and negative predictive values (NPV). RESULTS: 37 of 39 eligible studies assessed accuracy of International Classification of Diseases (ICD)-coded hospital or death certificate data. They varied widely in their settings, methods, reporting, quality, and in the choice and accuracy of codes. Although PPVs for stroke and its pathological types ranged from 6–97%, appropriately selected, stroke-specific codes (rather than broad cerebrovascular codes) consistently produced PPVs >70%, and in several studies >90%. The few studies with data on sensitivity, specificity and NPV showed higher sensitivity of hospital versus death certificate data for stroke, with specificity and NPV consistently >96%. Few studies assessed either primary care data or combinations of data sources. CONCLUSIONS: Particular stroke-specific codes can yield high PPVs (>90%) for stroke/stroke types. Inclusion of primary care data and combining data sources should improve accuracy in large epidemiological studies, but there is limited published information about these strategies. Public Library of Science 2015-10-23 /pmc/articles/PMC4619732/ /pubmed/26496350 http://dx.doi.org/10.1371/journal.pone.0140533 Text en © 2015 Woodfield et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Woodfield, Rebecca Grant, Ian Sudlow, Cathie L. M. Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group |
title | Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group |
title_full | Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group |
title_fullStr | Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group |
title_full_unstemmed | Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group |
title_short | Accuracy of Electronic Health Record Data for Identifying Stroke Cases in Large-Scale Epidemiological Studies: A Systematic Review from the UK Biobank Stroke Outcomes Group |
title_sort | accuracy of electronic health record data for identifying stroke cases in large-scale epidemiological studies: a systematic review from the uk biobank stroke outcomes group |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619732/ https://www.ncbi.nlm.nih.gov/pubmed/26496350 http://dx.doi.org/10.1371/journal.pone.0140533 |
work_keys_str_mv | AT woodfieldrebecca accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup AT grantian accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup AT accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup AT accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup AT sudlowcathielm accuracyofelectronichealthrecorddataforidentifyingstrokecasesinlargescaleepidemiologicalstudiesasystematicreviewfromtheukbiobankstrokeoutcomesgroup |