Cargando…

An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †

The topic of big data has attracted increasing interest in recent years. The emergence of big data leads to new difficulties in terms of protection models used for data privacy, which is of necessity for sharing and processing data. Protecting individuals’ sensitive information while maintaining the...

Descripción completa

Detalles Bibliográficos
Autores principales: Eyupoglu, Can, Aydin, Muhammed Ali, Zaim, Abdul Halim, Sertbas, Ahmet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512893/
https://www.ncbi.nlm.nih.gov/pubmed/33265463
http://dx.doi.org/10.3390/e20050373
_version_ 1783586262312026112
author Eyupoglu, Can
Aydin, Muhammed Ali
Zaim, Abdul Halim
Sertbas, Ahmet
author_facet Eyupoglu, Can
Aydin, Muhammed Ali
Zaim, Abdul Halim
Sertbas, Ahmet
author_sort Eyupoglu, Can
collection PubMed
description The topic of big data has attracted increasing interest in recent years. The emergence of big data leads to new difficulties in terms of protection models used for data privacy, which is of necessity for sharing and processing data. Protecting individuals’ sensitive information while maintaining the usability of the data set published is the most important challenge in privacy preserving. In this regard, data anonymization methods are utilized in order to protect data against identity disclosure and linking attacks. In this study, a novel data anonymization algorithm based on chaos and perturbation has been proposed for privacy and utility preserving in big data. The performance of the proposed algorithm is evaluated in terms of Kullback–Leibler divergence, probabilistic anonymity, classification accuracy, F-measure and execution time. The experimental results have shown that the proposed algorithm is efficient and performs better in terms of Kullback–Leibler divergence, classification accuracy and F-measure compared to most of the existing algorithms using the same data set. Resulting from applying chaos to perturb data, such successful algorithm is promising to be used in privacy preserving data mining and data publishing.
format Online
Article
Text
id pubmed-7512893
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75128932020-11-09 An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † Eyupoglu, Can Aydin, Muhammed Ali Zaim, Abdul Halim Sertbas, Ahmet Entropy (Basel) Article The topic of big data has attracted increasing interest in recent years. The emergence of big data leads to new difficulties in terms of protection models used for data privacy, which is of necessity for sharing and processing data. Protecting individuals’ sensitive information while maintaining the usability of the data set published is the most important challenge in privacy preserving. In this regard, data anonymization methods are utilized in order to protect data against identity disclosure and linking attacks. In this study, a novel data anonymization algorithm based on chaos and perturbation has been proposed for privacy and utility preserving in big data. The performance of the proposed algorithm is evaluated in terms of Kullback–Leibler divergence, probabilistic anonymity, classification accuracy, F-measure and execution time. The experimental results have shown that the proposed algorithm is efficient and performs better in terms of Kullback–Leibler divergence, classification accuracy and F-measure compared to most of the existing algorithms using the same data set. Resulting from applying chaos to perturb data, such successful algorithm is promising to be used in privacy preserving data mining and data publishing. MDPI 2018-05-17 /pmc/articles/PMC7512893/ /pubmed/33265463 http://dx.doi.org/10.3390/e20050373 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Eyupoglu, Can
Aydin, Muhammed Ali
Zaim, Abdul Halim
Sertbas, Ahmet
An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †
title An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †
title_full An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †
title_fullStr An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †
title_full_unstemmed An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †
title_short An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †
title_sort efficient big data anonymization algorithm based on chaos and perturbation techniques †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512893/
https://www.ncbi.nlm.nih.gov/pubmed/33265463
http://dx.doi.org/10.3390/e20050373
work_keys_str_mv AT eyupoglucan anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques
AT aydinmuhammedali anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques
AT zaimabdulhalim anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques
AT sertbasahmet anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques
AT eyupoglucan efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques
AT aydinmuhammedali efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques
AT zaimabdulhalim efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques
AT sertbasahmet efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques