Cargando…

Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study

BACKGROUND: The Australian Collaboration for Coordinated Enhanced Sentinel Surveillance (ACCESS) was established to monitor national testing and test outcomes for blood-borne viruses (BBVs) and sexually transmissible infections (STIs) in key populations. ACCESS extracts deidentified data from sentin...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Long, Stoové, Mark, Boyle, Douglas, Callander, Denton, McManus, Hamish, Asselin, Jason, Guy, Rebecca, Donovan, Basil, Hellard, Margaret, El-Hayek, Carol
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7381059/
https://www.ncbi.nlm.nih.gov/pubmed/32579128
http://dx.doi.org/10.2196/16757
_version_ 1783562966074916864
author Nguyen, Long
Stoové, Mark
Boyle, Douglas
Callander, Denton
McManus, Hamish
Asselin, Jason
Guy, Rebecca
Donovan, Basil
Hellard, Margaret
El-Hayek, Carol
author_facet Nguyen, Long
Stoové, Mark
Boyle, Douglas
Callander, Denton
McManus, Hamish
Asselin, Jason
Guy, Rebecca
Donovan, Basil
Hellard, Margaret
El-Hayek, Carol
author_sort Nguyen, Long
collection PubMed
description BACKGROUND: The Australian Collaboration for Coordinated Enhanced Sentinel Surveillance (ACCESS) was established to monitor national testing and test outcomes for blood-borne viruses (BBVs) and sexually transmissible infections (STIs) in key populations. ACCESS extracts deidentified data from sentinel health services that include general practice, sexual health, and infectious disease clinics, as well as public and private laboratories that conduct a large volume of BBV/STI testing. An important attribute of ACCESS is the ability to accurately link individual-level records within and between the participating sites, as this enables the system to produce reliable epidemiological measures. OBJECTIVE: The aim of this study was to evaluate the use of GRHANITE software in ACCESS to extract and link deidentified data from participating clinics and laboratories. GRHANITE generates irreversible hashed linkage keys based on patient-identifying data captured in the patient electronic medical records (EMRs) at the site. The algorithms to produce the data linkage keys use probabilistic linkage principles to account for variability and completeness of the underlying patient identifiers, producing up to four linkage key types per EMR. Errors in the linkage process can arise from imperfect or missing identifiers, impacting the system’s integrity. Therefore, it is important to evaluate the quality of the linkages created and evaluate the outcome of the linkage for ongoing public health surveillance. METHODS: Although ACCESS data are deidentified, we created two gold-standard datasets where the true match status could be confirmed in order to compare against record linkage results arising from different approaches of the GRHANITE Linkage Tool. We reported sensitivity, specificity, and positive and negative predictive values where possible and estimated specificity by comparing a history of HIV and hepatitis C antibody results for linked EMRs. RESULTS: Sensitivity ranged from 96% to 100%, and specificity was 100% when applying the GRHANITE Linkage Tool to a small gold-standard dataset of 3700 clinical medical records. Medical records in this dataset contained a very high level of data completeness by having the name, date of birth, post code, and Medicare number available for use in record linkage. In a larger gold-standard dataset containing 86,538 medical records across clinics and pathology services, with a lower level of data completeness, sensitivity ranged from 94% to 95% and estimated specificity ranged from 91% to 99% in 4 of the 6 different record linkage approaches. CONCLUSIONS: This study’s findings suggest that the GRHANITE Linkage Tool can be used to link deidentified patient records accurately and can be confidently used for public health surveillance in systems such as ACCESS.
format Online
Article
Text
id pubmed-7381059
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-73810592020-08-06 Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study Nguyen, Long Stoové, Mark Boyle, Douglas Callander, Denton McManus, Hamish Asselin, Jason Guy, Rebecca Donovan, Basil Hellard, Margaret El-Hayek, Carol J Med Internet Res Original Paper BACKGROUND: The Australian Collaboration for Coordinated Enhanced Sentinel Surveillance (ACCESS) was established to monitor national testing and test outcomes for blood-borne viruses (BBVs) and sexually transmissible infections (STIs) in key populations. ACCESS extracts deidentified data from sentinel health services that include general practice, sexual health, and infectious disease clinics, as well as public and private laboratories that conduct a large volume of BBV/STI testing. An important attribute of ACCESS is the ability to accurately link individual-level records within and between the participating sites, as this enables the system to produce reliable epidemiological measures. OBJECTIVE: The aim of this study was to evaluate the use of GRHANITE software in ACCESS to extract and link deidentified data from participating clinics and laboratories. GRHANITE generates irreversible hashed linkage keys based on patient-identifying data captured in the patient electronic medical records (EMRs) at the site. The algorithms to produce the data linkage keys use probabilistic linkage principles to account for variability and completeness of the underlying patient identifiers, producing up to four linkage key types per EMR. Errors in the linkage process can arise from imperfect or missing identifiers, impacting the system’s integrity. Therefore, it is important to evaluate the quality of the linkages created and evaluate the outcome of the linkage for ongoing public health surveillance. METHODS: Although ACCESS data are deidentified, we created two gold-standard datasets where the true match status could be confirmed in order to compare against record linkage results arising from different approaches of the GRHANITE Linkage Tool. We reported sensitivity, specificity, and positive and negative predictive values where possible and estimated specificity by comparing a history of HIV and hepatitis C antibody results for linked EMRs. RESULTS: Sensitivity ranged from 96% to 100%, and specificity was 100% when applying the GRHANITE Linkage Tool to a small gold-standard dataset of 3700 clinical medical records. Medical records in this dataset contained a very high level of data completeness by having the name, date of birth, post code, and Medicare number available for use in record linkage. In a larger gold-standard dataset containing 86,538 medical records across clinics and pathology services, with a lower level of data completeness, sensitivity ranged from 94% to 95% and estimated specificity ranged from 91% to 99% in 4 of the 6 different record linkage approaches. CONCLUSIONS: This study’s findings suggest that the GRHANITE Linkage Tool can be used to link deidentified patient records accurately and can be confidently used for public health surveillance in systems such as ACCESS. JMIR Publications 2020-06-24 /pmc/articles/PMC7381059/ /pubmed/32579128 http://dx.doi.org/10.2196/16757 Text en ©Long Nguyen, Mark Stoové, Douglas Boyle, Denton Callander, Hamish McManus, Jason Asselin, Rebecca Guy, Basil Donovan, Margaret Hellard, Carol El-Hayek. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 24.06.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Nguyen, Long
Stoové, Mark
Boyle, Douglas
Callander, Denton
McManus, Hamish
Asselin, Jason
Guy, Rebecca
Donovan, Basil
Hellard, Margaret
El-Hayek, Carol
Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study
title Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study
title_full Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study
title_fullStr Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study
title_full_unstemmed Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study
title_short Privacy-Preserving Record Linkage of Deidentified Records Within a Public Health Surveillance System: Evaluation Study
title_sort privacy-preserving record linkage of deidentified records within a public health surveillance system: evaluation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7381059/
https://www.ncbi.nlm.nih.gov/pubmed/32579128
http://dx.doi.org/10.2196/16757
work_keys_str_mv AT nguyenlong privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT stoovemark privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT boyledouglas privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT callanderdenton privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT mcmanushamish privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT asselinjason privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT guyrebecca privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT donovanbasil privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT hellardmargaret privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy
AT elhayekcarol privacypreservingrecordlinkageofdeidentifiedrecordswithinapublichealthsurveillancesystemevaluationstudy