Cargando…

An anonymization-based privacy-preserving data collection protocol for digital health data

Digital health data collection is vital for healthcare and medical research. But it contains sensitive information about patients, which makes it challenging. To collect health data without privacy breaches, it must be secured between the data owner and the collector. Existing data collection resear...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrew, J., Eunice, R. Jennifer, Karthikeyan, J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Fronstiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10020182/
https://www.ncbi.nlm.nih.gov/pubmed/36935661
http://dx.doi.org/10.3389/fpubh.2023.1125011
_version_ 1784908195799498752
author Andrew, J.
Eunice, R. Jennifer
Karthikeyan, J.
author_facet Andrew, J.
Eunice, R. Jennifer
Karthikeyan, J.
author_sort Andrew, J.
collection PubMed
description Digital health data collection is vital for healthcare and medical research. But it contains sensitive information about patients, which makes it challenging. To collect health data without privacy breaches, it must be secured between the data owner and the collector. Existing data collection research studies have too stringent assumptions such as using a third-party anonymizer or a private channel amid the data owner and the collector. These studies are more susceptible to privacy attacks due to third-party involvement, which makes them less applicable for privacy-preserving healthcare data collection. This article proposes a novel privacy-preserving data collection protocol that anonymizes healthcare data without using a third-party anonymizer or a private channel for data transmission. A clustering-based k-anonymity model was adopted to efficiently prevent identity disclosure attacks, and the communication between the data owner and the collector is restricted to some elected representatives of each equivalent group of data owners. We also identified a privacy attack, known as “leader collusion”, in which the elected representatives may collaborate to violate an individual's privacy. We propose solutions for such collisions and sensitive attribute protection. A greedy heuristic method is devised to efficiently handle the data owners who join or depart the anonymization process dynamically. Furthermore, we present the potential privacy attacks on the proposed protocol and theoretical analysis. Extensive experiments are conducted in real-world datasets, and the results suggest that our solution outperforms the state-of-the-art techniques in terms of privacy protection and computational complexity.
format Online
Article
Text
id pubmed-10020182
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Fronstiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100201822023-03-18 An anonymization-based privacy-preserving data collection protocol for digital health data Andrew, J. Eunice, R. Jennifer Karthikeyan, J. Front Public Health Public Health Digital health data collection is vital for healthcare and medical research. But it contains sensitive information about patients, which makes it challenging. To collect health data without privacy breaches, it must be secured between the data owner and the collector. Existing data collection research studies have too stringent assumptions such as using a third-party anonymizer or a private channel amid the data owner and the collector. These studies are more susceptible to privacy attacks due to third-party involvement, which makes them less applicable for privacy-preserving healthcare data collection. This article proposes a novel privacy-preserving data collection protocol that anonymizes healthcare data without using a third-party anonymizer or a private channel for data transmission. A clustering-based k-anonymity model was adopted to efficiently prevent identity disclosure attacks, and the communication between the data owner and the collector is restricted to some elected representatives of each equivalent group of data owners. We also identified a privacy attack, known as “leader collusion”, in which the elected representatives may collaborate to violate an individual's privacy. We propose solutions for such collisions and sensitive attribute protection. A greedy heuristic method is devised to efficiently handle the data owners who join or depart the anonymization process dynamically. Furthermore, we present the potential privacy attacks on the proposed protocol and theoretical analysis. Extensive experiments are conducted in real-world datasets, and the results suggest that our solution outperforms the state-of-the-art techniques in terms of privacy protection and computational complexity. Fronstiers Media S.A. 2023-03-03 /pmc/articles/PMC10020182/ /pubmed/36935661 http://dx.doi.org/10.3389/fpubh.2023.1125011 Text en Copyright © 2023 Andrew, Eunice and Karthikeyan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Andrew, J.
Eunice, R. Jennifer
Karthikeyan, J.
An anonymization-based privacy-preserving data collection protocol for digital health data
title An anonymization-based privacy-preserving data collection protocol for digital health data
title_full An anonymization-based privacy-preserving data collection protocol for digital health data
title_fullStr An anonymization-based privacy-preserving data collection protocol for digital health data
title_full_unstemmed An anonymization-based privacy-preserving data collection protocol for digital health data
title_short An anonymization-based privacy-preserving data collection protocol for digital health data
title_sort anonymization-based privacy-preserving data collection protocol for digital health data
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10020182/
https://www.ncbi.nlm.nih.gov/pubmed/36935661
http://dx.doi.org/10.3389/fpubh.2023.1125011
work_keys_str_mv AT andrewj ananonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata
AT eunicerjennifer ananonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata
AT karthikeyanj ananonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata
AT andrewj anonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata
AT eunicerjennifer anonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata
AT karthikeyanj anonymizationbasedprivacypreservingdatacollectionprotocolfordigitalhealthdata