Cargando…

Probabilistic record linkage

Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a ‘black box’ research tool. In this article, we aim to describ...

Descripción completa

Detalles Bibliográficos
Autores principales: Sayers, Adrian, Ben-Shlomo, Yoav, Blom, Ashley W, Steele, Fiona
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5005943/
https://www.ncbi.nlm.nih.gov/pubmed/26686842
http://dx.doi.org/10.1093/ije/dyv322
_version_ 1782450981744672768
author Sayers, Adrian
Ben-Shlomo, Yoav
Blom, Ashley W
Steele, Fiona
author_facet Sayers, Adrian
Ben-Shlomo, Yoav
Blom, Ashley W
Steele, Fiona
author_sort Sayers, Adrian
collection PubMed
description Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a ‘black box’ research tool. In this article, we aim to describe the process of probabilistic record linkage through a simple exemplar. We first introduce the concept of deterministic linkage and contrast this with probabilistic linkage. We illustrate each step of the process using a simple exemplar and describe the data structure required to perform a probabilistic linkage. We describe the process of calculating and interpreting matched weights and how to convert matched weights into posterior probabilities of a match using Bayes theorem. We conclude this article with a brief discussion of some of the computational demands of record linkage, how you might assess the quality of your linkage algorithm, and how epidemiologists can maximize the value of their record-linked research using robust record linkage methods.
format Online
Article
Text
id pubmed-5005943
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50059432016-09-06 Probabilistic record linkage Sayers, Adrian Ben-Shlomo, Yoav Blom, Ashley W Steele, Fiona Int J Epidemiol Education Corner Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a ‘black box’ research tool. In this article, we aim to describe the process of probabilistic record linkage through a simple exemplar. We first introduce the concept of deterministic linkage and contrast this with probabilistic linkage. We illustrate each step of the process using a simple exemplar and describe the data structure required to perform a probabilistic linkage. We describe the process of calculating and interpreting matched weights and how to convert matched weights into posterior probabilities of a match using Bayes theorem. We conclude this article with a brief discussion of some of the computational demands of record linkage, how you might assess the quality of your linkage algorithm, and how epidemiologists can maximize the value of their record-linked research using robust record linkage methods. Oxford University Press 2016-06 2015-12-20 /pmc/articles/PMC5005943/ /pubmed/26686842 http://dx.doi.org/10.1093/ije/dyv322 Text en © The Author 2015; Published by Oxford University Press on behalf of the International Epidemiological Association http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Education Corner
Sayers, Adrian
Ben-Shlomo, Yoav
Blom, Ashley W
Steele, Fiona
Probabilistic record linkage
title Probabilistic record linkage
title_full Probabilistic record linkage
title_fullStr Probabilistic record linkage
title_full_unstemmed Probabilistic record linkage
title_short Probabilistic record linkage
title_sort probabilistic record linkage
topic Education Corner
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5005943/
https://www.ncbi.nlm.nih.gov/pubmed/26686842
http://dx.doi.org/10.1093/ije/dyv322
work_keys_str_mv AT sayersadrian probabilisticrecordlinkage
AT benshlomoyoav probabilisticrecordlinkage
AT blomashleyw probabilisticrecordlinkage
AT steelefiona probabilisticrecordlinkage