Cargando…

Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications

Record linkage is increasingly used to expand the information available for public health research. An understanding of record linkage methods and the relevant strengths and limitations is important for robust analysis and interpretation of linked data. Here, we describe the approach used by Clinica...

Descripción completa

Detalles Bibliográficos
Autores principales: Padmanabhan, Shivani, Carty, Lucy, Cameron, Ellen, Ghosh, Rebecca E., Williams, Rachael, Strongman, Helen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6325980/
https://www.ncbi.nlm.nih.gov/pubmed/30219957
http://dx.doi.org/10.1007/s10654-018-0442-4
_version_ 1783386220295880704
author Padmanabhan, Shivani
Carty, Lucy
Cameron, Ellen
Ghosh, Rebecca E.
Williams, Rachael
Strongman, Helen
author_facet Padmanabhan, Shivani
Carty, Lucy
Cameron, Ellen
Ghosh, Rebecca E.
Williams, Rachael
Strongman, Helen
author_sort Padmanabhan, Shivani
collection PubMed
description Record linkage is increasingly used to expand the information available for public health research. An understanding of record linkage methods and the relevant strengths and limitations is important for robust analysis and interpretation of linked data. Here, we describe the approach used by Clinical Practice Research Datalink (CPRD) to link primary care data to other patient level datasets, and the potential implications of this approach for CPRD data analysis. General practice electronic health record software providers separately submit de-identified data to CPRD and patient identifiers to NHS Digital, excluding patients who have opted-out from contributing data. Data custodians for external datasets also send patient identifiers to NHS Digital. NHS Digital uses identifiers to link the datasets using an 8-stage deterministic methodology. CPRD subsequently receives a de-identified linked cohort file and provides researchers with anonymised linked data and metadata detailing the linkage process. This methodology has been used to generate routine primary care linked datasets, including data from Hospital Episode Statistics, Office for National Statistics and National Cancer Registration and Analysis Service. 10.6 million (M) patients from 411 English general practices were included in record linkage in June 2018. 9.1M (86%) patients were of research quality, of which 8.0M (88%) had a valid NHS number and were eligible for linkage in the CPRD standard linked dataset release. Linking CPRD data to other sources improves the range and validity of research studies. This manuscript, together with metadata generated on match strength and linkage eligibility, can be used to inform study design and explore potential linkage-related selection and misclassification biases.
format Online
Article
Text
id pubmed-6325980
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-63259802019-01-23 Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications Padmanabhan, Shivani Carty, Lucy Cameron, Ellen Ghosh, Rebecca E. Williams, Rachael Strongman, Helen Eur J Epidemiol Data Resources Record linkage is increasingly used to expand the information available for public health research. An understanding of record linkage methods and the relevant strengths and limitations is important for robust analysis and interpretation of linked data. Here, we describe the approach used by Clinical Practice Research Datalink (CPRD) to link primary care data to other patient level datasets, and the potential implications of this approach for CPRD data analysis. General practice electronic health record software providers separately submit de-identified data to CPRD and patient identifiers to NHS Digital, excluding patients who have opted-out from contributing data. Data custodians for external datasets also send patient identifiers to NHS Digital. NHS Digital uses identifiers to link the datasets using an 8-stage deterministic methodology. CPRD subsequently receives a de-identified linked cohort file and provides researchers with anonymised linked data and metadata detailing the linkage process. This methodology has been used to generate routine primary care linked datasets, including data from Hospital Episode Statistics, Office for National Statistics and National Cancer Registration and Analysis Service. 10.6 million (M) patients from 411 English general practices were included in record linkage in June 2018. 9.1M (86%) patients were of research quality, of which 8.0M (88%) had a valid NHS number and were eligible for linkage in the CPRD standard linked dataset release. Linking CPRD data to other sources improves the range and validity of research studies. This manuscript, together with metadata generated on match strength and linkage eligibility, can be used to inform study design and explore potential linkage-related selection and misclassification biases. Springer Netherlands 2018-09-15 2019 /pmc/articles/PMC6325980/ /pubmed/30219957 http://dx.doi.org/10.1007/s10654-018-0442-4 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Data Resources
Padmanabhan, Shivani
Carty, Lucy
Cameron, Ellen
Ghosh, Rebecca E.
Williams, Rachael
Strongman, Helen
Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications
title Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications
title_full Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications
title_fullStr Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications
title_full_unstemmed Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications
title_short Approach to record linkage of primary care data from Clinical Practice Research Datalink to other health-related patient data: overview and implications
title_sort approach to record linkage of primary care data from clinical practice research datalink to other health-related patient data: overview and implications
topic Data Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6325980/
https://www.ncbi.nlm.nih.gov/pubmed/30219957
http://dx.doi.org/10.1007/s10654-018-0442-4
work_keys_str_mv AT padmanabhanshivani approachtorecordlinkageofprimarycaredatafromclinicalpracticeresearchdatalinktootherhealthrelatedpatientdataoverviewandimplications
AT cartylucy approachtorecordlinkageofprimarycaredatafromclinicalpracticeresearchdatalinktootherhealthrelatedpatientdataoverviewandimplications
AT cameronellen approachtorecordlinkageofprimarycaredatafromclinicalpracticeresearchdatalinktootherhealthrelatedpatientdataoverviewandimplications
AT ghoshrebeccae approachtorecordlinkageofprimarycaredatafromclinicalpracticeresearchdatalinktootherhealthrelatedpatientdataoverviewandimplications
AT williamsrachael approachtorecordlinkageofprimarycaredatafromclinicalpracticeresearchdatalinktootherhealthrelatedpatientdataoverviewandimplications
AT strongmanhelen approachtorecordlinkageofprimarycaredatafromclinicalpracticeresearchdatalinktootherhealthrelatedpatientdataoverviewandimplications