Cargando…
An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques †
The topic of big data has attracted increasing interest in recent years. The emergence of big data leads to new difficulties in terms of protection models used for data privacy, which is of necessity for sharing and processing data. Protecting individuals’ sensitive information while maintaining the...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512893/ https://www.ncbi.nlm.nih.gov/pubmed/33265463 http://dx.doi.org/10.3390/e20050373 |
_version_ | 1783586262312026112 |
---|---|
author | Eyupoglu, Can Aydin, Muhammed Ali Zaim, Abdul Halim Sertbas, Ahmet |
author_facet | Eyupoglu, Can Aydin, Muhammed Ali Zaim, Abdul Halim Sertbas, Ahmet |
author_sort | Eyupoglu, Can |
collection | PubMed |
description | The topic of big data has attracted increasing interest in recent years. The emergence of big data leads to new difficulties in terms of protection models used for data privacy, which is of necessity for sharing and processing data. Protecting individuals’ sensitive information while maintaining the usability of the data set published is the most important challenge in privacy preserving. In this regard, data anonymization methods are utilized in order to protect data against identity disclosure and linking attacks. In this study, a novel data anonymization algorithm based on chaos and perturbation has been proposed for privacy and utility preserving in big data. The performance of the proposed algorithm is evaluated in terms of Kullback–Leibler divergence, probabilistic anonymity, classification accuracy, F-measure and execution time. The experimental results have shown that the proposed algorithm is efficient and performs better in terms of Kullback–Leibler divergence, classification accuracy and F-measure compared to most of the existing algorithms using the same data set. Resulting from applying chaos to perturb data, such successful algorithm is promising to be used in privacy preserving data mining and data publishing. |
format | Online Article Text |
id | pubmed-7512893 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75128932020-11-09 An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † Eyupoglu, Can Aydin, Muhammed Ali Zaim, Abdul Halim Sertbas, Ahmet Entropy (Basel) Article The topic of big data has attracted increasing interest in recent years. The emergence of big data leads to new difficulties in terms of protection models used for data privacy, which is of necessity for sharing and processing data. Protecting individuals’ sensitive information while maintaining the usability of the data set published is the most important challenge in privacy preserving. In this regard, data anonymization methods are utilized in order to protect data against identity disclosure and linking attacks. In this study, a novel data anonymization algorithm based on chaos and perturbation has been proposed for privacy and utility preserving in big data. The performance of the proposed algorithm is evaluated in terms of Kullback–Leibler divergence, probabilistic anonymity, classification accuracy, F-measure and execution time. The experimental results have shown that the proposed algorithm is efficient and performs better in terms of Kullback–Leibler divergence, classification accuracy and F-measure compared to most of the existing algorithms using the same data set. Resulting from applying chaos to perturb data, such successful algorithm is promising to be used in privacy preserving data mining and data publishing. MDPI 2018-05-17 /pmc/articles/PMC7512893/ /pubmed/33265463 http://dx.doi.org/10.3390/e20050373 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Eyupoglu, Can Aydin, Muhammed Ali Zaim, Abdul Halim Sertbas, Ahmet An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † |
title | An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † |
title_full | An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † |
title_fullStr | An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † |
title_full_unstemmed | An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † |
title_short | An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques † |
title_sort | efficient big data anonymization algorithm based on chaos and perturbation techniques † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512893/ https://www.ncbi.nlm.nih.gov/pubmed/33265463 http://dx.doi.org/10.3390/e20050373 |
work_keys_str_mv | AT eyupoglucan anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques AT aydinmuhammedali anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques AT zaimabdulhalim anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques AT sertbasahmet anefficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques AT eyupoglucan efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques AT aydinmuhammedali efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques AT zaimabdulhalim efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques AT sertbasahmet efficientbigdataanonymizationalgorithmbasedonchaosandperturbationtechniques |