Cargando…

Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values

BACKGROUND: Various methods based on k-anonymity have been proposed for publishing medical data while preserving privacy. However, the k-anonymity property assumes that adversaries possess fixed background knowledge. Although differential privacy overcomes this limitation, it is specialized for aggr...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Hyukki, Chung, Yon Dohn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7346516/
https://www.ncbi.nlm.nih.gov/pubmed/32641043
http://dx.doi.org/10.1186/s12911-020-01171-5
_version_ 1783556423462944768
author Lee, Hyukki
Chung, Yon Dohn
author_facet Lee, Hyukki
Chung, Yon Dohn
author_sort Lee, Hyukki
collection PubMed
description BACKGROUND: Various methods based on k-anonymity have been proposed for publishing medical data while preserving privacy. However, the k-anonymity property assumes that adversaries possess fixed background knowledge. Although differential privacy overcomes this limitation, it is specialized for aggregated results. Thus, it is difficult to obtain high-quality microdata. To address this issue, we propose a differentially private medical microdata release method featuring high utility. METHODS: We propose a method of anonymizing medical data under differential privacy. To improve data utility, especially by preserving informative attribute values, the proposed method adopts three data perturbation approaches: (1) generalization, (2) suppression, and (3) insertion. The proposed method produces an anonymized dataset that is nearly optimal with regard to utility, while preserving privacy. RESULTS: The proposed method achieves lower information loss than existing methods. Based on a real-world case study, we prove that the results of data analyses using the original dataset and those obtained using a dataset anonymized via the proposed method are considerably similar. CONCLUSIONS: We propose a novel differentially private anonymization method that preserves informative values for the release of medical data. Through experiments, we show that the utility of medical data that has been anonymized via the proposed method is significantly better than that of existing methods.
format Online
Article
Text
id pubmed-7346516
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73465162020-07-14 Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values Lee, Hyukki Chung, Yon Dohn BMC Med Inform Decis Mak Research Article BACKGROUND: Various methods based on k-anonymity have been proposed for publishing medical data while preserving privacy. However, the k-anonymity property assumes that adversaries possess fixed background knowledge. Although differential privacy overcomes this limitation, it is specialized for aggregated results. Thus, it is difficult to obtain high-quality microdata. To address this issue, we propose a differentially private medical microdata release method featuring high utility. METHODS: We propose a method of anonymizing medical data under differential privacy. To improve data utility, especially by preserving informative attribute values, the proposed method adopts three data perturbation approaches: (1) generalization, (2) suppression, and (3) insertion. The proposed method produces an anonymized dataset that is nearly optimal with regard to utility, while preserving privacy. RESULTS: The proposed method achieves lower information loss than existing methods. Based on a real-world case study, we prove that the results of data analyses using the original dataset and those obtained using a dataset anonymized via the proposed method are considerably similar. CONCLUSIONS: We propose a novel differentially private anonymization method that preserves informative values for the release of medical data. Through experiments, we show that the utility of medical data that has been anonymized via the proposed method is significantly better than that of existing methods. BioMed Central 2020-07-08 /pmc/articles/PMC7346516/ /pubmed/32641043 http://dx.doi.org/10.1186/s12911-020-01171-5 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Lee, Hyukki
Chung, Yon Dohn
Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values
title Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values
title_full Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values
title_fullStr Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values
title_full_unstemmed Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values
title_short Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values
title_sort differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7346516/
https://www.ncbi.nlm.nih.gov/pubmed/32641043
http://dx.doi.org/10.1186/s12911-020-01171-5
work_keys_str_mv AT leehyukki differentiallyprivatereleaseofmedicalmicrodataanefficientandpracticalapproachforpreservinginformativeattributevalues
AT chungyondohn differentiallyprivatereleaseofmedicalmicrodataanefficientandpracticalapproachforpreservinginformativeattributevalues