Cargando…

Probabilistic linkage without personal information successfully linked national clinical datasets

BACKGROUND: Probabilistic linkage can link patients from different clinical databases without the need for personal information. If accurate linkage can be achieved, it would accelerate the use of linked datasets to address important clinical and public health questions. OBJECTIVE: We developed a st...

Descripción completa

Detalles Bibliográficos
Autores principales: Blake, Helen A., Sharples, Linda D., Harron, Katie, van der Meulen, Jan H., Walker, Kate
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8443839/
https://www.ncbi.nlm.nih.gov/pubmed/33932483
http://dx.doi.org/10.1016/j.jclinepi.2021.04.015
_version_ 1784568375445291008
author Blake, Helen A.
Sharples, Linda D.
Harron, Katie
van der Meulen, Jan H.
Walker, Kate
author_facet Blake, Helen A.
Sharples, Linda D.
Harron, Katie
van der Meulen, Jan H.
Walker, Kate
author_sort Blake, Helen A.
collection PubMed
description BACKGROUND: Probabilistic linkage can link patients from different clinical databases without the need for personal information. If accurate linkage can be achieved, it would accelerate the use of linked datasets to address important clinical and public health questions. OBJECTIVE: We developed a step-by-step process for probabilistic linkage of national clinical and administrative datasets without personal information, and validated it against deterministic linkage using patient identifiers. STUDY DESIGN AND SETTING: We used electronic health records from the National Bowel Cancer Audit and Hospital Episode Statistics databases for 10,566 bowel cancer patients undergoing emergency surgery in the English National Health Service. RESULTS: Probabilistic linkage linked 81.4% of National Bowel Cancer Audit records to Hospital Episode Statistics, vs. 82.8% using deterministic linkage. No systematic differences were seen between patients that were and were not linked, and regression models for mortality and length of hospital stay according to patient and tumour characteristics were not sensitive to the linkage approach. CONCLUSION: Probabilistic linkage was successful in linking national clinical and administrative datasets for patients undergoing a major surgical procedure. It allows analysts outside highly secure data environments to undertake linkage while minimizing costs and delays, protecting data security, and maintaining linkage quality.
format Online
Article
Text
id pubmed-8443839
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-84438392021-09-22 Probabilistic linkage without personal information successfully linked national clinical datasets Blake, Helen A. Sharples, Linda D. Harron, Katie van der Meulen, Jan H. Walker, Kate J Clin Epidemiol Original Article BACKGROUND: Probabilistic linkage can link patients from different clinical databases without the need for personal information. If accurate linkage can be achieved, it would accelerate the use of linked datasets to address important clinical and public health questions. OBJECTIVE: We developed a step-by-step process for probabilistic linkage of national clinical and administrative datasets without personal information, and validated it against deterministic linkage using patient identifiers. STUDY DESIGN AND SETTING: We used electronic health records from the National Bowel Cancer Audit and Hospital Episode Statistics databases for 10,566 bowel cancer patients undergoing emergency surgery in the English National Health Service. RESULTS: Probabilistic linkage linked 81.4% of National Bowel Cancer Audit records to Hospital Episode Statistics, vs. 82.8% using deterministic linkage. No systematic differences were seen between patients that were and were not linked, and regression models for mortality and length of hospital stay according to patient and tumour characteristics were not sensitive to the linkage approach. CONCLUSION: Probabilistic linkage was successful in linking national clinical and administrative datasets for patients undergoing a major surgical procedure. It allows analysts outside highly secure data environments to undertake linkage while minimizing costs and delays, protecting data security, and maintaining linkage quality. Elsevier 2021-08 /pmc/articles/PMC8443839/ /pubmed/33932483 http://dx.doi.org/10.1016/j.jclinepi.2021.04.015 Text en © 2021 The Author(s). Published by Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Article
Blake, Helen A.
Sharples, Linda D.
Harron, Katie
van der Meulen, Jan H.
Walker, Kate
Probabilistic linkage without personal information successfully linked national clinical datasets
title Probabilistic linkage without personal information successfully linked national clinical datasets
title_full Probabilistic linkage without personal information successfully linked national clinical datasets
title_fullStr Probabilistic linkage without personal information successfully linked national clinical datasets
title_full_unstemmed Probabilistic linkage without personal information successfully linked national clinical datasets
title_short Probabilistic linkage without personal information successfully linked national clinical datasets
title_sort probabilistic linkage without personal information successfully linked national clinical datasets
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8443839/
https://www.ncbi.nlm.nih.gov/pubmed/33932483
http://dx.doi.org/10.1016/j.jclinepi.2021.04.015
work_keys_str_mv AT blakehelena probabilisticlinkagewithoutpersonalinformationsuccessfullylinkednationalclinicaldatasets
AT sharpleslindad probabilisticlinkagewithoutpersonalinformationsuccessfullylinkednationalclinicaldatasets
AT harronkatie probabilisticlinkagewithoutpersonalinformationsuccessfullylinkednationalclinicaldatasets
AT vandermeulenjanh probabilisticlinkagewithoutpersonalinformationsuccessfullylinkednationalclinicaldatasets
AT walkerkate probabilisticlinkagewithoutpersonalinformationsuccessfullylinkednationalclinicaldatasets