Cargando…
Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining
Data mining is traditionally adopted to retrieve and analyze knowledge from large amounts of data. Private or confidential data may be sanitized or suppressed before it is shared or published in public. Privacy preserving data mining (PPDM) has thus become an important issue in recent years. The mos...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005146/ https://www.ncbi.nlm.nih.gov/pubmed/24982932 http://dx.doi.org/10.1155/2014/235837 |
_version_ | 1782314066820202496 |
---|---|
author | Lin, Chun-Wei Hong, Tzung-Pei Hsu, Hung-Chuan |
author_facet | Lin, Chun-Wei Hong, Tzung-Pei Hsu, Hung-Chuan |
author_sort | Lin, Chun-Wei |
collection | PubMed |
description | Data mining is traditionally adopted to retrieve and analyze knowledge from large amounts of data. Private or confidential data may be sanitized or suppressed before it is shared or published in public. Privacy preserving data mining (PPDM) has thus become an important issue in recent years. The most general way of PPDM is to sanitize the database to hide the sensitive information. In this paper, a novel hiding-missing-artificial utility (HMAU) algorithm is proposed to hide sensitive itemsets through transaction deletion. The transaction with the maximal ratio of sensitive to nonsensitive one is thus selected to be entirely deleted. Three side effects of hiding failures, missing itemsets, and artificial itemsets are considered to evaluate whether the transactions are required to be deleted for hiding sensitive itemsets. Three weights are also assigned as the importance to three factors, which can be set according to the requirement of users. Experiments are then conducted to show the performance of the proposed algorithm in execution time, number of deleted transactions, and number of side effects. |
format | Online Article Text |
id | pubmed-4005146 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-40051462014-06-30 Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining Lin, Chun-Wei Hong, Tzung-Pei Hsu, Hung-Chuan ScientificWorldJournal Research Article Data mining is traditionally adopted to retrieve and analyze knowledge from large amounts of data. Private or confidential data may be sanitized or suppressed before it is shared or published in public. Privacy preserving data mining (PPDM) has thus become an important issue in recent years. The most general way of PPDM is to sanitize the database to hide the sensitive information. In this paper, a novel hiding-missing-artificial utility (HMAU) algorithm is proposed to hide sensitive itemsets through transaction deletion. The transaction with the maximal ratio of sensitive to nonsensitive one is thus selected to be entirely deleted. Three side effects of hiding failures, missing itemsets, and artificial itemsets are considered to evaluate whether the transactions are required to be deleted for hiding sensitive itemsets. Three weights are also assigned as the importance to three factors, which can be set according to the requirement of users. Experiments are then conducted to show the performance of the proposed algorithm in execution time, number of deleted transactions, and number of side effects. Hindawi Publishing Corporation 2014 2014-04-10 /pmc/articles/PMC4005146/ /pubmed/24982932 http://dx.doi.org/10.1155/2014/235837 Text en Copyright © 2014 Chun-Wei Lin et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Lin, Chun-Wei Hong, Tzung-Pei Hsu, Hung-Chuan Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining |
title | Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining |
title_full | Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining |
title_fullStr | Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining |
title_full_unstemmed | Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining |
title_short | Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining |
title_sort | reducing side effects of hiding sensitive itemsets in privacy preserving data mining |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005146/ https://www.ncbi.nlm.nih.gov/pubmed/24982932 http://dx.doi.org/10.1155/2014/235837 |
work_keys_str_mv | AT linchunwei reducingsideeffectsofhidingsensitiveitemsetsinprivacypreservingdatamining AT hongtzungpei reducingsideeffectsofhidingsensitiveitemsetsinprivacypreservingdatamining AT hsuhungchuan reducingsideeffectsofhidingsensitiveitemsetsinprivacypreservingdatamining |