Cargando…
Participation in patient support forums may put rare disease patient data at risk of re-identification
BACKGROUND: Rare disease patients often struggle to find both medical advice and emotional support for their diagnosis. Consequently, many rare disease patient support forums have appeared on hospital webpages, social media sites, and on rare disease foundation sites. However, we argue that engageme...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7457524/ https://www.ncbi.nlm.nih.gov/pubmed/32867839 http://dx.doi.org/10.1186/s13023-020-01497-3 |
_version_ | 1783576011586142208 |
---|---|
author | Gow, James Moffatt, Colin Blackport, Jamie |
author_facet | Gow, James Moffatt, Colin Blackport, Jamie |
author_sort | Gow, James |
collection | PubMed |
description | BACKGROUND: Rare disease patients often struggle to find both medical advice and emotional support for their diagnosis. Consequently, many rare disease patient support forums have appeared on hospital webpages, social media sites, and on rare disease foundation sites. However, we argue that engagement in these groups may pose a healthcare data privacy threat to many participants, since it makes a series of patient indirect identifiers ‘readily available’ in combination with rare disease conditions. This information produces a risk of re-identification because it may allow a motivated attacker to use the unique combination of a patient’s identifiers and disease condition to re-identify them in anonymized data. RESULTS: To assess this risk of re-identification, patient direct and indirect identifiers were mined from patient support forums for 80 patients across eight rare diseases. This data mining consisted of scanning patient testimonials, social media sites, and public records for the collection of identifiers linked to a rare disease patient. The number of people in the United States that may share each patient’s combination of marital status, 3-digit ZIP code, age, and sex, as well as their rare disease condition, was then estimated, as such information is commonly found in health records which have undergone de-identification by HIPAA’s ‘Safe Harbor.’ The study showed that by these estimations, nearly 75% of patients could be at high risk for re-identification in healthcare datasets in which they appear, due to their unique combination of identifiers. CONCLUSIONS: The results of this study show that these rare disease patients, due to their choice to provide support for their community, are putting all their healthcare data at risk of re-identification. This paper demonstrates how simple adjustments to participation guidelines in such support forums, in combination with improved privacy measures at the organizational level, could mitigate this risk of re-identification. Additionally, this paper suggests the potential for future investigation into consideration of certain ‘risky’ International Classification of Diseases (ICD) codes as quasi-identifiers in de-identified datasets to further protect patients’ privacy, while maintaining the utility of such rare disease support groups. |
format | Online Article Text |
id | pubmed-7457524 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-74575242020-08-31 Participation in patient support forums may put rare disease patient data at risk of re-identification Gow, James Moffatt, Colin Blackport, Jamie Orphanet J Rare Dis Research BACKGROUND: Rare disease patients often struggle to find both medical advice and emotional support for their diagnosis. Consequently, many rare disease patient support forums have appeared on hospital webpages, social media sites, and on rare disease foundation sites. However, we argue that engagement in these groups may pose a healthcare data privacy threat to many participants, since it makes a series of patient indirect identifiers ‘readily available’ in combination with rare disease conditions. This information produces a risk of re-identification because it may allow a motivated attacker to use the unique combination of a patient’s identifiers and disease condition to re-identify them in anonymized data. RESULTS: To assess this risk of re-identification, patient direct and indirect identifiers were mined from patient support forums for 80 patients across eight rare diseases. This data mining consisted of scanning patient testimonials, social media sites, and public records for the collection of identifiers linked to a rare disease patient. The number of people in the United States that may share each patient’s combination of marital status, 3-digit ZIP code, age, and sex, as well as their rare disease condition, was then estimated, as such information is commonly found in health records which have undergone de-identification by HIPAA’s ‘Safe Harbor.’ The study showed that by these estimations, nearly 75% of patients could be at high risk for re-identification in healthcare datasets in which they appear, due to their unique combination of identifiers. CONCLUSIONS: The results of this study show that these rare disease patients, due to their choice to provide support for their community, are putting all their healthcare data at risk of re-identification. This paper demonstrates how simple adjustments to participation guidelines in such support forums, in combination with improved privacy measures at the organizational level, could mitigate this risk of re-identification. Additionally, this paper suggests the potential for future investigation into consideration of certain ‘risky’ International Classification of Diseases (ICD) codes as quasi-identifiers in de-identified datasets to further protect patients’ privacy, while maintaining the utility of such rare disease support groups. BioMed Central 2020-08-31 /pmc/articles/PMC7457524/ /pubmed/32867839 http://dx.doi.org/10.1186/s13023-020-01497-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Gow, James Moffatt, Colin Blackport, Jamie Participation in patient support forums may put rare disease patient data at risk of re-identification |
title | Participation in patient support forums may put rare disease patient data at risk of re-identification |
title_full | Participation in patient support forums may put rare disease patient data at risk of re-identification |
title_fullStr | Participation in patient support forums may put rare disease patient data at risk of re-identification |
title_full_unstemmed | Participation in patient support forums may put rare disease patient data at risk of re-identification |
title_short | Participation in patient support forums may put rare disease patient data at risk of re-identification |
title_sort | participation in patient support forums may put rare disease patient data at risk of re-identification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7457524/ https://www.ncbi.nlm.nih.gov/pubmed/32867839 http://dx.doi.org/10.1186/s13023-020-01497-3 |
work_keys_str_mv | AT gowjames participationinpatientsupportforumsmayputrarediseasepatientdataatriskofreidentification AT moffattcolin participationinpatientsupportforumsmayputrarediseasepatientdataatriskofreidentification AT blackportjamie participationinpatientsupportforumsmayputrarediseasepatientdataatriskofreidentification |