Cargando…

Participation in patient support forums may put rare disease patient data at risk of re-identification

BACKGROUND: Rare disease patients often struggle to find both medical advice and emotional support for their diagnosis. Consequently, many rare disease patient support forums have appeared on hospital webpages, social media sites, and on rare disease foundation sites. However, we argue that engageme...

Descripción completa

Detalles Bibliográficos
Autores principales: Gow, James, Moffatt, Colin, Blackport, Jamie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7457524/
https://www.ncbi.nlm.nih.gov/pubmed/32867839
http://dx.doi.org/10.1186/s13023-020-01497-3
_version_ 1783576011586142208
author Gow, James
Moffatt, Colin
Blackport, Jamie
author_facet Gow, James
Moffatt, Colin
Blackport, Jamie
author_sort Gow, James
collection PubMed
description BACKGROUND: Rare disease patients often struggle to find both medical advice and emotional support for their diagnosis. Consequently, many rare disease patient support forums have appeared on hospital webpages, social media sites, and on rare disease foundation sites. However, we argue that engagement in these groups may pose a healthcare data privacy threat to many participants, since it makes a series of patient indirect identifiers ‘readily available’ in combination with rare disease conditions. This information produces a risk of re-identification because it may allow a motivated attacker to use the unique combination of a patient’s identifiers and disease condition to re-identify them in anonymized data. RESULTS: To assess this risk of re-identification, patient direct and indirect identifiers were mined from patient support forums for 80 patients across eight rare diseases. This data mining consisted of scanning patient testimonials, social media sites, and public records for the collection of identifiers linked to a rare disease patient. The number of people in the United States that may share each patient’s combination of marital status, 3-digit ZIP code, age, and sex, as well as their rare disease condition, was then estimated, as such information is commonly found in health records which have undergone de-identification by HIPAA’s ‘Safe Harbor.’ The study showed that by these estimations, nearly 75% of patients could be at high risk for re-identification in healthcare datasets in which they appear, due to their unique combination of identifiers. CONCLUSIONS: The results of this study show that these rare disease patients, due to their choice to provide support for their community, are putting all their healthcare data at risk of re-identification. This paper demonstrates how simple adjustments to participation guidelines in such support forums, in combination with improved privacy measures at the organizational level, could mitigate this risk of re-identification. Additionally, this paper suggests the potential for future investigation into consideration of certain ‘risky’ International Classification of Diseases (ICD) codes as quasi-identifiers in de-identified datasets to further protect patients’ privacy, while maintaining the utility of such rare disease support groups.
format Online
Article
Text
id pubmed-7457524
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74575242020-08-31 Participation in patient support forums may put rare disease patient data at risk of re-identification Gow, James Moffatt, Colin Blackport, Jamie Orphanet J Rare Dis Research BACKGROUND: Rare disease patients often struggle to find both medical advice and emotional support for their diagnosis. Consequently, many rare disease patient support forums have appeared on hospital webpages, social media sites, and on rare disease foundation sites. However, we argue that engagement in these groups may pose a healthcare data privacy threat to many participants, since it makes a series of patient indirect identifiers ‘readily available’ in combination with rare disease conditions. This information produces a risk of re-identification because it may allow a motivated attacker to use the unique combination of a patient’s identifiers and disease condition to re-identify them in anonymized data. RESULTS: To assess this risk of re-identification, patient direct and indirect identifiers were mined from patient support forums for 80 patients across eight rare diseases. This data mining consisted of scanning patient testimonials, social media sites, and public records for the collection of identifiers linked to a rare disease patient. The number of people in the United States that may share each patient’s combination of marital status, 3-digit ZIP code, age, and sex, as well as their rare disease condition, was then estimated, as such information is commonly found in health records which have undergone de-identification by HIPAA’s ‘Safe Harbor.’ The study showed that by these estimations, nearly 75% of patients could be at high risk for re-identification in healthcare datasets in which they appear, due to their unique combination of identifiers. CONCLUSIONS: The results of this study show that these rare disease patients, due to their choice to provide support for their community, are putting all their healthcare data at risk of re-identification. This paper demonstrates how simple adjustments to participation guidelines in such support forums, in combination with improved privacy measures at the organizational level, could mitigate this risk of re-identification. Additionally, this paper suggests the potential for future investigation into consideration of certain ‘risky’ International Classification of Diseases (ICD) codes as quasi-identifiers in de-identified datasets to further protect patients’ privacy, while maintaining the utility of such rare disease support groups. BioMed Central 2020-08-31 /pmc/articles/PMC7457524/ /pubmed/32867839 http://dx.doi.org/10.1186/s13023-020-01497-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Gow, James
Moffatt, Colin
Blackport, Jamie
Participation in patient support forums may put rare disease patient data at risk of re-identification
title Participation in patient support forums may put rare disease patient data at risk of re-identification
title_full Participation in patient support forums may put rare disease patient data at risk of re-identification
title_fullStr Participation in patient support forums may put rare disease patient data at risk of re-identification
title_full_unstemmed Participation in patient support forums may put rare disease patient data at risk of re-identification
title_short Participation in patient support forums may put rare disease patient data at risk of re-identification
title_sort participation in patient support forums may put rare disease patient data at risk of re-identification
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7457524/
https://www.ncbi.nlm.nih.gov/pubmed/32867839
http://dx.doi.org/10.1186/s13023-020-01497-3
work_keys_str_mv AT gowjames participationinpatientsupportforumsmayputrarediseasepatientdataatriskofreidentification
AT moffattcolin participationinpatientsupportforumsmayputrarediseasepatientdataatriskofreidentification
AT blackportjamie participationinpatientsupportforumsmayputrarediseasepatientdataatriskofreidentification