Cargando…

Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data

Personally identifiable information (PII) affects individual privacy because PII combinations may yield unique identifications in published data. User PII such as age, race, gender, and zip code contain private information that may assist an adversary in determining the user to whom such information...

Descripción completa

Detalles Bibliográficos
Autores principales: Majeed, Abdul, Ullah, Farman, Lee, Sungchang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5469664/
https://www.ncbi.nlm.nih.gov/pubmed/28481298
http://dx.doi.org/10.3390/s17051059
_version_ 1783243621916475392
author Majeed, Abdul
Ullah, Farman
Lee, Sungchang
author_facet Majeed, Abdul
Ullah, Farman
Lee, Sungchang
author_sort Majeed, Abdul
collection PubMed
description Personally identifiable information (PII) affects individual privacy because PII combinations may yield unique identifications in published data. User PII such as age, race, gender, and zip code contain private information that may assist an adversary in determining the user to whom such information relates. Each item of user PII reveals identity differently, and some types of PII are highly identity vulnerable. More vulnerable types of PII enable unique identification more easily, and their presence in published data increases privacy risks. Existing privacy models treat all types of PII equally from an identity revelation point of view, and they mainly focus on hiding user PII in a crowd of other users. Ignoring the identity vulnerability of each type of PII during anonymization is not an effective method of protecting user privacy in a fine-grained manner. This paper proposes a new anonymization scheme that considers the identity vulnerability of PII to effectively protect user privacy. Data generalization is performed adaptively based on the identity vulnerability of PII as well as diversity to anonymize data. This adaptive generalization effectively enables anonymous data, which protects user identity and private information disclosures while maximizing the utility of data for performing analyses and building classification models. Additionally, the proposed scheme has low computational overheads. The simulation results show the effectiveness of the scheme and verify the aforementioned claims.
format Online
Article
Text
id pubmed-5469664
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-54696642017-06-16 Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data Majeed, Abdul Ullah, Farman Lee, Sungchang Sensors (Basel) Article Personally identifiable information (PII) affects individual privacy because PII combinations may yield unique identifications in published data. User PII such as age, race, gender, and zip code contain private information that may assist an adversary in determining the user to whom such information relates. Each item of user PII reveals identity differently, and some types of PII are highly identity vulnerable. More vulnerable types of PII enable unique identification more easily, and their presence in published data increases privacy risks. Existing privacy models treat all types of PII equally from an identity revelation point of view, and they mainly focus on hiding user PII in a crowd of other users. Ignoring the identity vulnerability of each type of PII during anonymization is not an effective method of protecting user privacy in a fine-grained manner. This paper proposes a new anonymization scheme that considers the identity vulnerability of PII to effectively protect user privacy. Data generalization is performed adaptively based on the identity vulnerability of PII as well as diversity to anonymize data. This adaptive generalization effectively enables anonymous data, which protects user identity and private information disclosures while maximizing the utility of data for performing analyses and building classification models. Additionally, the proposed scheme has low computational overheads. The simulation results show the effectiveness of the scheme and verify the aforementioned claims. MDPI 2017-05-08 /pmc/articles/PMC5469664/ /pubmed/28481298 http://dx.doi.org/10.3390/s17051059 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Majeed, Abdul
Ullah, Farman
Lee, Sungchang
Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data
title Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data
title_full Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data
title_fullStr Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data
title_full_unstemmed Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data
title_short Vulnerability- and Diversity-Aware Anonymization of Personally Identifiable Information for Improving User Privacy and Utility of Publishing Data
title_sort vulnerability- and diversity-aware anonymization of personally identifiable information for improving user privacy and utility of publishing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5469664/
https://www.ncbi.nlm.nih.gov/pubmed/28481298
http://dx.doi.org/10.3390/s17051059
work_keys_str_mv AT majeedabdul vulnerabilityanddiversityawareanonymizationofpersonallyidentifiableinformationforimprovinguserprivacyandutilityofpublishingdata
AT ullahfarman vulnerabilityanddiversityawareanonymizationofpersonallyidentifiableinformationforimprovinguserprivacyandutilityofpublishingdata
AT leesungchang vulnerabilityanddiversityawareanonymizationofpersonallyidentifiableinformationforimprovinguserprivacyandutilityofpublishingdata