Cargando…

Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data

BACKGROUND: Linking medical records across different medical service providers is important to the enhancement of health care quality and public health surveillance. In records linkage, protecting the patients’ privacy is a primary requirement. In real-world health care databases, records may well c...

Descripción completa

Detalles Bibliográficos
Autores principales: Pal, Doyel, Chen, Tingting, Zhong, Sheng, Khethavath, Praveen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Gunther Eysenbach 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288117/
https://www.ncbi.nlm.nih.gov/pubmed/25600786
http://dx.doi.org/10.2196/medinform.3090
_version_ 1782351912926969856
author Pal, Doyel
Chen, Tingting
Zhong, Sheng
Khethavath, Praveen
author_facet Pal, Doyel
Chen, Tingting
Zhong, Sheng
Khethavath, Praveen
author_sort Pal, Doyel
collection PubMed
description BACKGROUND: Linking medical records across different medical service providers is important to the enhancement of health care quality and public health surveillance. In records linkage, protecting the patients’ privacy is a primary requirement. In real-world health care databases, records may well contain errors due to various reasons such as typos. Linking the error-prone data and preserving data privacy at the same time are very difficult. Existing privacy preserving solutions for this problem are only restricted to textual data. OBJECTIVE: To enable different medical service providers to link their error-prone data in a private way, our aim was to provide a holistic solution by designing and developing a medical record linkage system for medical service providers. METHODS: To initiate a record linkage, one provider selects one of its collaborators in the Connection Management Module, chooses some attributes of the database to be matched, and establishes the connection with the collaborator after the negotiation. In the Data Matching Module, for error-free data, our solution offered two different choices for cryptographic schemes. For error-prone numerical data, we proposed a newly designed privacy preserving linking algorithm named the Error-Tolerant Linking Algorithm, that allows the error-prone data to be correctly matched if the distance between the two records is below a threshold. RESULTS: We designed and developed a comprehensive and user-friendly software system that provides privacy preserving record linkage functions for medical service providers, which meets the regulation of Health Insurance Portability and Accountability Act. It does not require a third party and it is secure in that neither entity can learn the records in the other’s database. Moreover, our novel Error-Tolerant Linking Algorithm implemented in this software can work well with error-prone numerical data. We theoretically proved the correctness and security of our Error-Tolerant Linking Algorithm. We have also fully implemented the software. The experimental results showed that it is reliable and efficient. The design of our software is open so that the existing textual matching methods can be easily integrated into the system. CONCLUSIONS: Designing algorithms to enable medical records linkage for error-prone numerical data and protect data privacy at the same time is difficult. Our proposed solution does not need a trusted third party and is secure in that in the linking process, neither entity can learn the records in the other’s database.
format Online
Article
Text
id pubmed-4288117
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Gunther Eysenbach
record_format MEDLINE/PubMed
spelling pubmed-42881172015-01-15 Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data Pal, Doyel Chen, Tingting Zhong, Sheng Khethavath, Praveen JMIR Med Inform Original Paper BACKGROUND: Linking medical records across different medical service providers is important to the enhancement of health care quality and public health surveillance. In records linkage, protecting the patients’ privacy is a primary requirement. In real-world health care databases, records may well contain errors due to various reasons such as typos. Linking the error-prone data and preserving data privacy at the same time are very difficult. Existing privacy preserving solutions for this problem are only restricted to textual data. OBJECTIVE: To enable different medical service providers to link their error-prone data in a private way, our aim was to provide a holistic solution by designing and developing a medical record linkage system for medical service providers. METHODS: To initiate a record linkage, one provider selects one of its collaborators in the Connection Management Module, chooses some attributes of the database to be matched, and establishes the connection with the collaborator after the negotiation. In the Data Matching Module, for error-free data, our solution offered two different choices for cryptographic schemes. For error-prone numerical data, we proposed a newly designed privacy preserving linking algorithm named the Error-Tolerant Linking Algorithm, that allows the error-prone data to be correctly matched if the distance between the two records is below a threshold. RESULTS: We designed and developed a comprehensive and user-friendly software system that provides privacy preserving record linkage functions for medical service providers, which meets the regulation of Health Insurance Portability and Accountability Act. It does not require a third party and it is secure in that neither entity can learn the records in the other’s database. Moreover, our novel Error-Tolerant Linking Algorithm implemented in this software can work well with error-prone numerical data. We theoretically proved the correctness and security of our Error-Tolerant Linking Algorithm. We have also fully implemented the software. The experimental results showed that it is reliable and efficient. The design of our software is open so that the existing textual matching methods can be easily integrated into the system. CONCLUSIONS: Designing algorithms to enable medical records linkage for error-prone numerical data and protect data privacy at the same time is difficult. Our proposed solution does not need a trusted third party and is secure in that in the linking process, neither entity can learn the records in the other’s database. Gunther Eysenbach 2014-01-20 /pmc/articles/PMC4288117/ /pubmed/25600786 http://dx.doi.org/10.2196/medinform.3090 Text en ©Doyel Pal, Tingting Chen, Sheng Zhong, Praveen Khethavath. Originally published in JMIR Research Protocols (http://medinform.jmir.org), 20.01.2014. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Research Protocols, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Pal, Doyel
Chen, Tingting
Zhong, Sheng
Khethavath, Praveen
Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data
title Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data
title_full Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data
title_fullStr Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data
title_full_unstemmed Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data
title_short Designing an Algorithm to Preserve Privacy for Medical Record Linkage With Error-Prone Data
title_sort designing an algorithm to preserve privacy for medical record linkage with error-prone data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288117/
https://www.ncbi.nlm.nih.gov/pubmed/25600786
http://dx.doi.org/10.2196/medinform.3090
work_keys_str_mv AT paldoyel designinganalgorithmtopreserveprivacyformedicalrecordlinkagewitherrorpronedata
AT chentingting designinganalgorithmtopreserveprivacyformedicalrecordlinkagewitherrorpronedata
AT zhongsheng designinganalgorithmtopreserveprivacyformedicalrecordlinkagewitherrorpronedata
AT khethavathpraveen designinganalgorithmtopreserveprivacyformedicalrecordlinkagewitherrorpronedata