Cargando…

Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining

Data mining is traditionally adopted to retrieve and analyze knowledge from large amounts of data. Private or confidential data may be sanitized or suppressed before it is shared or published in public. Privacy preserving data mining (PPDM) has thus become an important issue in recent years. The mos...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Chun-Wei, Hong, Tzung-Pei, Hsu, Hung-Chuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005146/
https://www.ncbi.nlm.nih.gov/pubmed/24982932
http://dx.doi.org/10.1155/2014/235837
_version_ 1782314066820202496
author Lin, Chun-Wei
Hong, Tzung-Pei
Hsu, Hung-Chuan
author_facet Lin, Chun-Wei
Hong, Tzung-Pei
Hsu, Hung-Chuan
author_sort Lin, Chun-Wei
collection PubMed
description Data mining is traditionally adopted to retrieve and analyze knowledge from large amounts of data. Private or confidential data may be sanitized or suppressed before it is shared or published in public. Privacy preserving data mining (PPDM) has thus become an important issue in recent years. The most general way of PPDM is to sanitize the database to hide the sensitive information. In this paper, a novel hiding-missing-artificial utility (HMAU) algorithm is proposed to hide sensitive itemsets through transaction deletion. The transaction with the maximal ratio of sensitive to nonsensitive one is thus selected to be entirely deleted. Three side effects of hiding failures, missing itemsets, and artificial itemsets are considered to evaluate whether the transactions are required to be deleted for hiding sensitive itemsets. Three weights are also assigned as the importance to three factors, which can be set according to the requirement of users. Experiments are then conducted to show the performance of the proposed algorithm in execution time, number of deleted transactions, and number of side effects.
format Online
Article
Text
id pubmed-4005146
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-40051462014-06-30 Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining Lin, Chun-Wei Hong, Tzung-Pei Hsu, Hung-Chuan ScientificWorldJournal Research Article Data mining is traditionally adopted to retrieve and analyze knowledge from large amounts of data. Private or confidential data may be sanitized or suppressed before it is shared or published in public. Privacy preserving data mining (PPDM) has thus become an important issue in recent years. The most general way of PPDM is to sanitize the database to hide the sensitive information. In this paper, a novel hiding-missing-artificial utility (HMAU) algorithm is proposed to hide sensitive itemsets through transaction deletion. The transaction with the maximal ratio of sensitive to nonsensitive one is thus selected to be entirely deleted. Three side effects of hiding failures, missing itemsets, and artificial itemsets are considered to evaluate whether the transactions are required to be deleted for hiding sensitive itemsets. Three weights are also assigned as the importance to three factors, which can be set according to the requirement of users. Experiments are then conducted to show the performance of the proposed algorithm in execution time, number of deleted transactions, and number of side effects. Hindawi Publishing Corporation 2014 2014-04-10 /pmc/articles/PMC4005146/ /pubmed/24982932 http://dx.doi.org/10.1155/2014/235837 Text en Copyright © 2014 Chun-Wei Lin et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lin, Chun-Wei
Hong, Tzung-Pei
Hsu, Hung-Chuan
Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining
title Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining
title_full Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining
title_fullStr Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining
title_full_unstemmed Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining
title_short Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining
title_sort reducing side effects of hiding sensitive itemsets in privacy preserving data mining
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005146/
https://www.ncbi.nlm.nih.gov/pubmed/24982932
http://dx.doi.org/10.1155/2014/235837
work_keys_str_mv AT linchunwei reducingsideeffectsofhidingsensitiveitemsetsinprivacypreservingdatamining
AT hongtzungpei reducingsideeffectsofhidingsensitiveitemsetsinprivacypreservingdatamining
AT hsuhungchuan reducingsideeffectsofhidingsensitiveitemsetsinprivacypreservingdatamining