Cargando…
An anonymization-based privacy-preserving data collection protocol for digital health data
Digital health data collection is vital for healthcare and medical research. But it contains sensitive information about patients, which makes it challenging. To collect health data without privacy breaches, it must be secured between the data owner and the collector. Existing data collection resear...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Fronstiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10020182/ https://www.ncbi.nlm.nih.gov/pubmed/36935661 http://dx.doi.org/10.3389/fpubh.2023.1125011 |
_version_ | 1784908195799498752 |
---|---|
author | Andrew, J. Eunice, R. Jennifer Karthikeyan, J. |
author_facet | Andrew, J. Eunice, R. Jennifer Karthikeyan, J. |
author_sort | Andrew, J. |
collection | PubMed |
description | Digital health data collection is vital for healthcare and medical research. But it contains sensitive information about patients, which makes it challenging. To collect health data without privacy breaches, it must be secured between the data owner and the collector. Existing data collection research studies have too stringent assumptions such as using a third-party anonymizer or a private channel amid the data owner and the collector. These studies are more susceptible to privacy attacks due to third-party involvement, which makes them less applicable for privacy-preserving healthcare data collection. This article proposes a novel privacy-preserving data collection protocol that anonymizes healthcare data without using a third-party anonymizer or a private channel for data transmission. A clustering-based k-anonymity model was adopted to efficiently prevent identity disclosure attacks, and the communication between the data owner and the collector is restricted to some elected representatives of each equivalent group of data owners. We also identified a privacy attack, known as “leader collusion”, in which the elected representatives may collaborate to violate an individual's privacy. We propose solutions for such collisions and sensitive attribute protection. A greedy heuristic method is devised to efficiently handle the data owners who join or depart the anonymization process dynamically. Furthermore, we present the potential privacy attacks on the proposed protocol and theoretical analysis. Extensive experiments are conducted in real-world datasets, and the results suggest that our solution outperforms the state-of-the-art techniques in terms of privacy protection and computational complexity. |
format | Online Article Text |
id | pubmed-10020182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Fronstiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-100201822023-03-18 An anonymization-based privacy-preserving data collection protocol for digital health data Andrew, J. Eunice, R. Jennifer Karthikeyan, J. Front Public Health Public Health Digital health data collection is vital for healthcare and medical research. But it contains sensitive information about patients, which makes it challenging. To collect health data without privacy breaches, it must be secured between the data owner and the collector. Existing data collection research studies have too stringent assumptions such as using a third-party anonymizer or a private channel amid the data owner and the collector. These studies are more susceptible to privacy attacks due to third-party involvement, which makes them less applicable for privacy-preserving healthcare data collection. This article proposes a novel privacy-preserving data collection protocol that anonymizes healthcare data without using a third-party anonymizer or a private channel for data transmission. A clustering-based k-anonymity model was adopted to efficiently prevent identity disclosure attacks, and the communication between the data owner and the collector is restricted to some elected representatives of each equivalent group of data owners. We also identified a privacy attack, known as “leader collusion”, in which the elected representatives may collaborate to violate an individual's privacy. We propose solutions for such collisions and sensitive attribute protection. A greedy heuristic method is devised to efficiently handle the data owners who join or depart the anonymization process dynamically. Furthermore, we present the potential privacy attacks on the proposed protocol and theoretical analysis. Extensive experiments are conducted in real-world datasets, and the results suggest that our solution outperforms the state-of-the-art techniques in terms of privacy protection and computational complexity. Fronstiers Media S.A. 2023-03-03 /pmc/articles/PMC10020182/ /pubmed/36935661 http://dx.doi.org/10.3389/fpubh.2023.1125011 Text en Copyright © 2023 Andrew, Eunice and Karthikeyan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Andrew, J. Eunice, R. Jennifer Karthikeyan, J. An anonymization-based privacy-preserving data collection protocol for digital health data |
title | An anonymization-based privacy-preserving data collection protocol for digital health data |
title_full | An anonymization-based privacy-preserving data collection protocol for digital health data |
title_fullStr | An anonymization-based privacy-preserving data collection protocol for digital health data |
title_full_unstemmed | An anonymization-based privacy-preserving data collection protocol for digital health data |
title_short | An anonymization-based privacy-preserving data collection protocol for digital health data |
title_sort | anonymization-based privacy-preserving data collection protocol for digital health data |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10020182/ https://www.ncbi.nlm.nih.gov/pubmed/36935661 http://dx.doi.org/10.3389/fpubh.2023.1125011 |
work_keys_str_mv | AT andrewj ananonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata AT eunicerjennifer ananonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata AT karthikeyanj ananonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata AT andrewj anonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata AT eunicerjennifer anonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata AT karthikeyanj anonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata |