Cargando…

Demystifying probabilistic linkage: Common myths and misconceptions

Many of the distinctions made between probabilistic and deterministic linkage are misleading. While these two approaches to record linkage operate in different ways and can produce different outputs, the distinctions between them are more a result of how they are implemented than because of any intr...

Descripción completa

Detalles Bibliográficos
Autores principales: Doidge, James C, Harron, Katie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Swansea University 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6281162/
https://www.ncbi.nlm.nih.gov/pubmed/30533534
http://dx.doi.org/10.23889/ijpds.v3i1.410
_version_ 1783378790255165440
author Doidge, James C
Harron, Katie
author_facet Doidge, James C
Harron, Katie
author_sort Doidge, James C
collection PubMed
description Many of the distinctions made between probabilistic and deterministic linkage are misleading. While these two approaches to record linkage operate in different ways and can produce different outputs, the distinctions between them are more a result of how they are implemented than because of any intrinsic differences. In the way they are generally applied, probabilistic and deterministic procedures can be little more than alternative means to similar ends—or they can arrive at very different ends depending on choices that are made during implementation. Misconceptions about probabilistic linkage contribute to reluctance for implementing it and mistrust of its outputs. We aim to explain how the outputs of either approach can be tailored to suit the intended application, but also to highlight the ways in which probabilistic linkage is generally more flexible, more powerful and more informed by the data. This is accomplished by examining common misconceptions about probabilistic linkage and its difference from deterministic linkage, highlighting the potential impact of design choices on the outputs of either approach. We hope that better understanding of linkage designs will help to allay concerns about probabilistic linkage, and help data linkers to select and tailor procedures to produce outputs that are appropriate for their intended use.
format Online
Article
Text
id pubmed-6281162
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Swansea University
record_format MEDLINE/PubMed
spelling pubmed-62811622018-12-05 Demystifying probabilistic linkage: Common myths and misconceptions Doidge, James C Harron, Katie Int J Popul Data Sci Population Data Science Many of the distinctions made between probabilistic and deterministic linkage are misleading. While these two approaches to record linkage operate in different ways and can produce different outputs, the distinctions between them are more a result of how they are implemented than because of any intrinsic differences. In the way they are generally applied, probabilistic and deterministic procedures can be little more than alternative means to similar ends—or they can arrive at very different ends depending on choices that are made during implementation. Misconceptions about probabilistic linkage contribute to reluctance for implementing it and mistrust of its outputs. We aim to explain how the outputs of either approach can be tailored to suit the intended application, but also to highlight the ways in which probabilistic linkage is generally more flexible, more powerful and more informed by the data. This is accomplished by examining common misconceptions about probabilistic linkage and its difference from deterministic linkage, highlighting the potential impact of design choices on the outputs of either approach. We hope that better understanding of linkage designs will help to allay concerns about probabilistic linkage, and help data linkers to select and tailor procedures to produce outputs that are appropriate for their intended use. Swansea University 2018-01-10 /pmc/articles/PMC6281162/ /pubmed/30533534 http://dx.doi.org/10.23889/ijpds.v3i1.410 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
spellingShingle Population Data Science
Doidge, James C
Harron, Katie
Demystifying probabilistic linkage: Common myths and misconceptions
title Demystifying probabilistic linkage: Common myths and misconceptions
title_full Demystifying probabilistic linkage: Common myths and misconceptions
title_fullStr Demystifying probabilistic linkage: Common myths and misconceptions
title_full_unstemmed Demystifying probabilistic linkage: Common myths and misconceptions
title_short Demystifying probabilistic linkage: Common myths and misconceptions
title_sort demystifying probabilistic linkage: common myths and misconceptions
topic Population Data Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6281162/
https://www.ncbi.nlm.nih.gov/pubmed/30533534
http://dx.doi.org/10.23889/ijpds.v3i1.410
work_keys_str_mv AT doidgejamesc demystifyingprobabilisticlinkagecommonmythsandmisconceptions
AT harronkatie demystifyingprobabilisticlinkagecommonmythsandmisconceptions