Cargando…
Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data
Personally identifiable information (PII) affects individual privacy because PII combinations may yield unique identifications in published data. User PII such as age, race, gender, and zip code contain private information that may assist an adversary in determining the user to whom such information...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5469664/ https://www.ncbi.nlm.nih.gov/pubmed/28481298 http://dx.doi.org/10.3390/s17051059 |
_version_ | 1783243621916475392 |
---|---|
author | Majeed, Abdul Ullah, Farman Lee, Sungchang |
author_facet | Majeed, Abdul Ullah, Farman Lee, Sungchang |
author_sort | Majeed, Abdul |
collection | PubMed |
description | Personally identifiable information (PII) affects individual privacy because PII combinations may yield unique identifications in published data. User PII such as age, race, gender, and zip code contain private information that may assist an adversary in determining the user to whom such information relates. Each item of user PII reveals identity differently, and some types of PII are highly identity vulnerable. More vulnerable types of PII enable unique identification more easily, and their presence in published data increases privacy risks. Existing privacy models treat all types of PII equally from an identity revelation point of view, and they mainly focus on hiding user PII in a crowd of other users. Ignoring the identity vulnerability of each type of PII during anonymization is not an effective method of protecting user privacy in a fine-grained manner. This paper proposes a new anonymization scheme that considers the identity vulnerability of PII to effectively protect user privacy. Data generalization is performed adaptively based on the identity vulnerability of PII as well as diversity to anonymize data. This adaptive generalization effectively enables anonymous data, which protects user identity and private information disclosures while maximizing the utility of data for performing analyses and building classification models. Additionally, the proposed scheme has low computational overheads. The simulation results show the effectiveness of the scheme and verify the aforementioned claims. |
format | Online Article Text |
id | pubmed-5469664 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-54696642017-06-16 Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data Majeed, Abdul Ullah, Farman Lee, Sungchang Sensors (Basel) Article Personally identifiable information (PII) affects individual privacy because PII combinations may yield unique identifications in published data. User PII such as age, race, gender, and zip code contain private information that may assist an adversary in determining the user to whom such information relates. Each item of user PII reveals identity differently, and some types of PII are highly identity vulnerable. More vulnerable types of PII enable unique identification more easily, and their presence in published data increases privacy risks. Existing privacy models treat all types of PII equally from an identity revelation point of view, and they mainly focus on hiding user PII in a crowd of other users. Ignoring the identity vulnerability of each type of PII during anonymization is not an effective method of protecting user privacy in a fine-grained manner. This paper proposes a new anonymization scheme that considers the identity vulnerability of PII to effectively protect user privacy. Data generalization is performed adaptively based on the identity vulnerability of PII as well as diversity to anonymize data. This adaptive generalization effectively enables anonymous data, which protects user identity and private information disclosures while maximizing the utility of data for performing analyses and building classification models. Additionally, the proposed scheme has low computational overheads. The simulation results show the effectiveness of the scheme and verify the aforementioned claims. MDPI 2017-05-08 /pmc/articles/PMC5469664/ /pubmed/28481298 http://dx.doi.org/10.3390/s17051059 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Majeed, Abdul Ullah, Farman Lee, Sungchang Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data |
title | Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data |
title_full | Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data |
title_fullStr | Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data |
title_full_unstemmed | Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data |
title_short | Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data |
title_sort | vulnerability- and diversity-aware anonymization of personally identifiable information for improving user privacy and utility of publishing data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5469664/ https://www.ncbi.nlm.nih.gov/pubmed/28481298 http://dx.doi.org/10.3390/s17051059 |
work_keys_str_mv | AT majeedabdul vulnerabilityanddiversityawareanonymizationofpersonallyidentifiableinformationforimprovinguserprivacyandutilityofpublishingdata AT ullahfarman vulnerabilityanddiversityawareanonymizationofpersonallyidentifiableinformationforimprovinguserprivacyandutilityofpublishingdata AT leesungchang vulnerabilityanddiversityawareanonymizationofpersonallyidentifiableinformationforimprovinguserprivacyandutilityofpublishingdata |