Cargando…

Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’

BACKGROUND: Methods for linking records between two datasets are well established. However, guidance is needed for linking more than two datasets. Using all ‘pairwise linkages’—linking each dataset to every other dataset—is the most inclusive, but resource-intensive, approach. The ‘spine’ approach l...

Descripción completa

Detalles Bibliográficos
Autores principales: Blake, Helen A, Sharples, Linda D, Harron, Katie, van der Meulen, Jan H, Walker, Kate
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908066/
https://www.ncbi.nlm.nih.gov/pubmed/35748342
http://dx.doi.org/10.1093/ije/dyac130
_version_ 1784884305243144192
author Blake, Helen A
Sharples, Linda D
Harron, Katie
van der Meulen, Jan H
Walker, Kate
author_facet Blake, Helen A
Sharples, Linda D
Harron, Katie
van der Meulen, Jan H
Walker, Kate
author_sort Blake, Helen A
collection PubMed
description BACKGROUND: Methods for linking records between two datasets are well established. However, guidance is needed for linking more than two datasets. Using all ‘pairwise linkages’—linking each dataset to every other dataset—is the most inclusive, but resource-intensive, approach. The ‘spine’ approach links each dataset to a designated ‘spine dataset’, reducing the number of linkages, but potentially reducing linkage quality. METHODS: We compared the pairwise and spine linkage approaches using real-world data on patients undergoing emergency bowel cancer surgery between 31 October 2013 and 30 April 2018. We linked an administrative hospital dataset (Hospital Episode Statistics; HES) capturing patients admitted to hospitals in England, and two clinical datasets comprising patients diagnosed with bowel cancer and patients undergoing emergency bowel surgery. RESULTS: The spine linkage approach, with HES as the spine dataset, created an analysis cohort of 15 826 patients, equating to 98.3% of the 16 100 patients identified using the pairwise linkage approach. There were no systematic differences in patient characteristics between these analysis cohorts. Associations of patient and tumour characteristics with mortality, complications and length of stay were not sensitive to the linkage approach. When eligibility criteria were applied before linkage, spine linkage included 14 509 patients (90.0% compared with pairwise linkage). CONCLUSION: Spine linkage can be used as an efficient alternative to pairwise linkage if case ascertainment in the spine dataset and data quality of linkage variables are high. These aspects should be systematically evaluated in the nominated spine dataset before spine linkage is used to create the analysis cohort.
format Online
Article
Text
id pubmed-9908066
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99080662023-02-09 Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’ Blake, Helen A Sharples, Linda D Harron, Katie van der Meulen, Jan H Walker, Kate Int J Epidemiol Methods BACKGROUND: Methods for linking records between two datasets are well established. However, guidance is needed for linking more than two datasets. Using all ‘pairwise linkages’—linking each dataset to every other dataset—is the most inclusive, but resource-intensive, approach. The ‘spine’ approach links each dataset to a designated ‘spine dataset’, reducing the number of linkages, but potentially reducing linkage quality. METHODS: We compared the pairwise and spine linkage approaches using real-world data on patients undergoing emergency bowel cancer surgery between 31 October 2013 and 30 April 2018. We linked an administrative hospital dataset (Hospital Episode Statistics; HES) capturing patients admitted to hospitals in England, and two clinical datasets comprising patients diagnosed with bowel cancer and patients undergoing emergency bowel surgery. RESULTS: The spine linkage approach, with HES as the spine dataset, created an analysis cohort of 15 826 patients, equating to 98.3% of the 16 100 patients identified using the pairwise linkage approach. There were no systematic differences in patient characteristics between these analysis cohorts. Associations of patient and tumour characteristics with mortality, complications and length of stay were not sensitive to the linkage approach. When eligibility criteria were applied before linkage, spine linkage included 14 509 patients (90.0% compared with pairwise linkage). CONCLUSION: Spine linkage can be used as an efficient alternative to pairwise linkage if case ascertainment in the spine dataset and data quality of linkage variables are high. These aspects should be systematically evaluated in the nominated spine dataset before spine linkage is used to create the analysis cohort. Oxford University Press 2022-06-24 /pmc/articles/PMC9908066/ /pubmed/35748342 http://dx.doi.org/10.1093/ije/dyac130 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the International Epidemiological Association. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Blake, Helen A
Sharples, Linda D
Harron, Katie
van der Meulen, Jan H
Walker, Kate
Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’
title Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’
title_full Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’
title_fullStr Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’
title_full_unstemmed Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’
title_short Linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’
title_sort linkage of multiple electronic health record datasets using a ‘spine linkage’ approach compared with all ‘pairwise linkages’
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908066/
https://www.ncbi.nlm.nih.gov/pubmed/35748342
http://dx.doi.org/10.1093/ije/dyac130
work_keys_str_mv AT blakehelena linkageofmultipleelectronichealthrecorddatasetsusingaspinelinkageapproachcomparedwithallpairwiselinkages
AT sharpleslindad linkageofmultipleelectronichealthrecorddatasetsusingaspinelinkageapproachcomparedwithallpairwiselinkages
AT harronkatie linkageofmultipleelectronichealthrecorddatasetsusingaspinelinkageapproachcomparedwithallpairwiselinkages
AT vandermeulenjanh linkageofmultipleelectronichealthrecorddatasetsusingaspinelinkageapproachcomparedwithallpairwiselinkages
AT walkerkate linkageofmultipleelectronichealthrecorddatasetsusingaspinelinkageapproachcomparedwithallpairwiselinkages