Cargando…

Privacy-preserving record linkage using Bloom filters

BACKGROUND: Combining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching reco...

Descripción completa

Detalles Bibliográficos
Autores principales: Schnell, Rainer, Bachteler, Tobias, Reiher, Jörg
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2753305/
https://www.ncbi.nlm.nih.gov/pubmed/19706187
http://dx.doi.org/10.1186/1472-6947-9-41
_version_ 1782172329595371520
author Schnell, Rainer
Bachteler, Tobias
Reiher, Jörg
author_facet Schnell, Rainer
Bachteler, Tobias
Reiher, Jörg
author_sort Schnell, Rainer
collection PubMed
description BACKGROUND: Combining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns. METHODS: A new protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers has been developed. The protocol is based on Bloom filters on q-grams of identifiers. RESULTS: Tests on simulated and actual databases yield linkage results comparable to non-encrypted identifiers and superior to results from phonetic encodings. CONCLUSION: We proposed a protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers. Since the protocol can be easily enhanced and has a low computational burden, the protocol might be useful for many applications requiring privacy-preserving record linkage.
format Text
id pubmed-2753305
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27533052009-09-29 Privacy-preserving record linkage using Bloom filters Schnell, Rainer Bachteler, Tobias Reiher, Jörg BMC Med Inform Decis Mak Technical Advance BACKGROUND: Combining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns. METHODS: A new protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers has been developed. The protocol is based on Bloom filters on q-grams of identifiers. RESULTS: Tests on simulated and actual databases yield linkage results comparable to non-encrypted identifiers and superior to results from phonetic encodings. CONCLUSION: We proposed a protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers. Since the protocol can be easily enhanced and has a low computational burden, the protocol might be useful for many applications requiring privacy-preserving record linkage. BioMed Central 2009-08-25 /pmc/articles/PMC2753305/ /pubmed/19706187 http://dx.doi.org/10.1186/1472-6947-9-41 Text en Copyright ©2009 Schnell et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Advance
Schnell, Rainer
Bachteler, Tobias
Reiher, Jörg
Privacy-preserving record linkage using Bloom filters
title Privacy-preserving record linkage using Bloom filters
title_full Privacy-preserving record linkage using Bloom filters
title_fullStr Privacy-preserving record linkage using Bloom filters
title_full_unstemmed Privacy-preserving record linkage using Bloom filters
title_short Privacy-preserving record linkage using Bloom filters
title_sort privacy-preserving record linkage using bloom filters
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2753305/
https://www.ncbi.nlm.nih.gov/pubmed/19706187
http://dx.doi.org/10.1186/1472-6947-9-41
work_keys_str_mv AT schnellrainer privacypreservingrecordlinkageusingbloomfilters
AT bachtelertobias privacypreservingrecordlinkageusingbloomfilters
AT reiherjorg privacypreservingrecordlinkageusingbloomfilters