Cargando…
Algorithms to anonymize structured medical and healthcare data: A systematic review
Introduction: With many anonymization algorithms developed for structured medical health data (SMHD) in the last decade, our systematic review provides a comprehensive bird’s eye view of algorithms for SMHD anonymization. Methods: This systematic review was conducted according to the recommendations...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815524/ https://www.ncbi.nlm.nih.gov/pubmed/36619476 http://dx.doi.org/10.3389/fbinf.2022.984807 |
_version_ | 1784864337107615744 |
---|---|
author | Sepas, Ali Bangash, Ali Haider Alraoui, Omar El Emam, Khaled El-Hussuna, Alaa |
author_facet | Sepas, Ali Bangash, Ali Haider Alraoui, Omar El Emam, Khaled El-Hussuna, Alaa |
author_sort | Sepas, Ali |
collection | PubMed |
description | Introduction: With many anonymization algorithms developed for structured medical health data (SMHD) in the last decade, our systematic review provides a comprehensive bird’s eye view of algorithms for SMHD anonymization. Methods: This systematic review was conducted according to the recommendations in the Cochrane Handbook for Reviews of Interventions and reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). Eligible articles from the PubMed, ACM digital library, Medline, IEEE, Embase, Web of Science Collection, Scopus, ProQuest Dissertation, and Theses Global databases were identified through systematic searches. The following parameters were extracted from the eligible studies: author, year of publication, sample size, and relevant algorithms and/or software applied to anonymize SMHD, along with the summary of outcomes. Results: Among 1,804 initial hits, the present study considered 63 records including research articles, reviews, and books. Seventy five evaluated the anonymization of demographic data, 18 assessed diagnosis codes, and 3 assessed genomic data. One of the most common approaches was k-anonymity, which was utilized mainly for demographic data, often in combination with another algorithm; e.g., l-diversity. No approaches have yet been developed for protection against membership disclosure attacks on diagnosis codes. Conclusion: This study reviewed and categorized different anonymization approaches for MHD according to the anonymized data types (demographics, diagnosis codes, and genomic data). Further research is needed to develop more efficient algorithms for the anonymization of diagnosis codes and genomic data. The risk of reidentification can be minimized with adequate application of the addressed anonymization approaches. Systematic Review Registration: [http://www.crd.york.ac.uk/prospero], identifier [CRD42021228200]. |
format | Online Article Text |
id | pubmed-9815524 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98155242023-01-06 Algorithms to anonymize structured medical and healthcare data: A systematic review Sepas, Ali Bangash, Ali Haider Alraoui, Omar El Emam, Khaled El-Hussuna, Alaa Front Bioinform Bioinformatics Introduction: With many anonymization algorithms developed for structured medical health data (SMHD) in the last decade, our systematic review provides a comprehensive bird’s eye view of algorithms for SMHD anonymization. Methods: This systematic review was conducted according to the recommendations in the Cochrane Handbook for Reviews of Interventions and reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). Eligible articles from the PubMed, ACM digital library, Medline, IEEE, Embase, Web of Science Collection, Scopus, ProQuest Dissertation, and Theses Global databases were identified through systematic searches. The following parameters were extracted from the eligible studies: author, year of publication, sample size, and relevant algorithms and/or software applied to anonymize SMHD, along with the summary of outcomes. Results: Among 1,804 initial hits, the present study considered 63 records including research articles, reviews, and books. Seventy five evaluated the anonymization of demographic data, 18 assessed diagnosis codes, and 3 assessed genomic data. One of the most common approaches was k-anonymity, which was utilized mainly for demographic data, often in combination with another algorithm; e.g., l-diversity. No approaches have yet been developed for protection against membership disclosure attacks on diagnosis codes. Conclusion: This study reviewed and categorized different anonymization approaches for MHD according to the anonymized data types (demographics, diagnosis codes, and genomic data). Further research is needed to develop more efficient algorithms for the anonymization of diagnosis codes and genomic data. The risk of reidentification can be minimized with adequate application of the addressed anonymization approaches. Systematic Review Registration: [http://www.crd.york.ac.uk/prospero], identifier [CRD42021228200]. Frontiers Media S.A. 2022-12-22 /pmc/articles/PMC9815524/ /pubmed/36619476 http://dx.doi.org/10.3389/fbinf.2022.984807 Text en Copyright © 2022 Sepas, Bangash, Alraoui, El Emam and El-Hussuna. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Sepas, Ali Bangash, Ali Haider Alraoui, Omar El Emam, Khaled El-Hussuna, Alaa Algorithms to anonymize structured medical and healthcare data: A systematic review |
title | Algorithms to anonymize structured medical and healthcare data: A systematic review |
title_full | Algorithms to anonymize structured medical and healthcare data: A systematic review |
title_fullStr | Algorithms to anonymize structured medical and healthcare data: A systematic review |
title_full_unstemmed | Algorithms to anonymize structured medical and healthcare data: A systematic review |
title_short | Algorithms to anonymize structured medical and healthcare data: A systematic review |
title_sort | algorithms to anonymize structured medical and healthcare data: a systematic review |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815524/ https://www.ncbi.nlm.nih.gov/pubmed/36619476 http://dx.doi.org/10.3389/fbinf.2022.984807 |
work_keys_str_mv | AT sepasali algorithmstoanonymizestructuredmedicalandhealthcaredataasystematicreview AT bangashalihaider algorithmstoanonymizestructuredmedicalandhealthcaredataasystematicreview AT alraouiomar algorithmstoanonymizestructuredmedicalandhealthcaredataasystematicreview AT elemamkhaled algorithmstoanonymizestructuredmedicalandhealthcaredataasystematicreview AT elhussunaalaa algorithmstoanonymizestructuredmedicalandhealthcaredataasystematicreview |