Cargando…
Probabilistic record linkage
Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a ‘black box’ research tool. In this article, we aim to describ...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5005943/ https://www.ncbi.nlm.nih.gov/pubmed/26686842 http://dx.doi.org/10.1093/ije/dyv322 |
_version_ | 1782450981744672768 |
---|---|
author | Sayers, Adrian Ben-Shlomo, Yoav Blom, Ashley W Steele, Fiona |
author_facet | Sayers, Adrian Ben-Shlomo, Yoav Blom, Ashley W Steele, Fiona |
author_sort | Sayers, Adrian |
collection | PubMed |
description | Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a ‘black box’ research tool. In this article, we aim to describe the process of probabilistic record linkage through a simple exemplar. We first introduce the concept of deterministic linkage and contrast this with probabilistic linkage. We illustrate each step of the process using a simple exemplar and describe the data structure required to perform a probabilistic linkage. We describe the process of calculating and interpreting matched weights and how to convert matched weights into posterior probabilities of a match using Bayes theorem. We conclude this article with a brief discussion of some of the computational demands of record linkage, how you might assess the quality of your linkage algorithm, and how epidemiologists can maximize the value of their record-linked research using robust record linkage methods. |
format | Online Article Text |
id | pubmed-5005943 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-50059432016-09-06 Probabilistic record linkage Sayers, Adrian Ben-Shlomo, Yoav Blom, Ashley W Steele, Fiona Int J Epidemiol Education Corner Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a ‘black box’ research tool. In this article, we aim to describe the process of probabilistic record linkage through a simple exemplar. We first introduce the concept of deterministic linkage and contrast this with probabilistic linkage. We illustrate each step of the process using a simple exemplar and describe the data structure required to perform a probabilistic linkage. We describe the process of calculating and interpreting matched weights and how to convert matched weights into posterior probabilities of a match using Bayes theorem. We conclude this article with a brief discussion of some of the computational demands of record linkage, how you might assess the quality of your linkage algorithm, and how epidemiologists can maximize the value of their record-linked research using robust record linkage methods. Oxford University Press 2016-06 2015-12-20 /pmc/articles/PMC5005943/ /pubmed/26686842 http://dx.doi.org/10.1093/ije/dyv322 Text en © The Author 2015; Published by Oxford University Press on behalf of the International Epidemiological Association http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Education Corner Sayers, Adrian Ben-Shlomo, Yoav Blom, Ashley W Steele, Fiona Probabilistic record linkage |
title | Probabilistic record linkage |
title_full | Probabilistic record linkage |
title_fullStr | Probabilistic record linkage |
title_full_unstemmed | Probabilistic record linkage |
title_short | Probabilistic record linkage |
title_sort | probabilistic record linkage |
topic | Education Corner |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5005943/ https://www.ncbi.nlm.nih.gov/pubmed/26686842 http://dx.doi.org/10.1093/ije/dyv322 |
work_keys_str_mv | AT sayersadrian probabilisticrecordlinkage AT benshlomoyoav probabilisticrecordlinkage AT blomashleyw probabilisticrecordlinkage AT steelefiona probabilisticrecordlinkage |