Cargando…
A generic solution for web-based management of pseudonymized data
BACKGROUND: Collaborative collection and sharing of data have become a core element of biomedical research. Typical applications are multi-site registries which collect sensitive person-related data prospectively, often together with biospecimens. To secure these sensitive data, national and interna...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4665916/ https://www.ncbi.nlm.nih.gov/pubmed/26621059 http://dx.doi.org/10.1186/s12911-015-0222-y |
_version_ | 1782403637478162432 |
---|---|
author | Lautenschläger, Ronald Kohlmayer, Florian Prasser, Fabian Kuhn, Klaus A. |
author_facet | Lautenschläger, Ronald Kohlmayer, Florian Prasser, Fabian Kuhn, Klaus A. |
author_sort | Lautenschläger, Ronald |
collection | PubMed |
description | BACKGROUND: Collaborative collection and sharing of data have become a core element of biomedical research. Typical applications are multi-site registries which collect sensitive person-related data prospectively, often together with biospecimens. To secure these sensitive data, national and international data protection laws and regulations demand the separation of identifying data from biomedical data and to introduce pseudonyms. Neither the formulation in laws and regulations nor existing pseudonymization concepts, however, are precise enough to directly provide an implementation guideline. We therefore describe core requirements as well as implementation options for registries and study databases with sensitive biomedical data. METHODS: We first analyze existing concepts and compile a set of fundamental requirements for pseudonymized data management. Then we derive a system architecture that fulfills these requirements. Next, we provide a comprehensive overview and a comparison of different technical options for an implementation. Finally, we develop a generic software solution for managing pseudonymized data and show its feasibility by describing how we have used it to realize two research networks. RESULTS: We have found that pseudonymization models are highly heterogeneous, already on a conceptual level. We have compiled a set of requirements from different pseudonymization schemes. We propose an architecture and present an overview of technical options. Based on a selection of technical elements, we suggest a generic solution. It supports the multi-site collection and management of biomedical data. Security measures are multi-tier pseudonymity and physical separation of data over independent backend servers. Integrated views are provided by a web-based user interface. Our approach has been successfully used to implement a national and an international rare disease network. CONCLUSIONS: We were able to identify a set of core requirements out of several pseudonymization models. Considering various implementation options, we realized a generic solution which was implemented and deployed in research networks. Still, further conceptual work on pseudonymity is needed. Specifically, it remains unclear how exactly data is to be separated into distributed subsets. Moreover, a thorough risk and threat analysis is needed. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-015-0222-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4665916 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46659162015-12-02 A generic solution for web-based management of pseudonymized data Lautenschläger, Ronald Kohlmayer, Florian Prasser, Fabian Kuhn, Klaus A. BMC Med Inform Decis Mak Technical Advance BACKGROUND: Collaborative collection and sharing of data have become a core element of biomedical research. Typical applications are multi-site registries which collect sensitive person-related data prospectively, often together with biospecimens. To secure these sensitive data, national and international data protection laws and regulations demand the separation of identifying data from biomedical data and to introduce pseudonyms. Neither the formulation in laws and regulations nor existing pseudonymization concepts, however, are precise enough to directly provide an implementation guideline. We therefore describe core requirements as well as implementation options for registries and study databases with sensitive biomedical data. METHODS: We first analyze existing concepts and compile a set of fundamental requirements for pseudonymized data management. Then we derive a system architecture that fulfills these requirements. Next, we provide a comprehensive overview and a comparison of different technical options for an implementation. Finally, we develop a generic software solution for managing pseudonymized data and show its feasibility by describing how we have used it to realize two research networks. RESULTS: We have found that pseudonymization models are highly heterogeneous, already on a conceptual level. We have compiled a set of requirements from different pseudonymization schemes. We propose an architecture and present an overview of technical options. Based on a selection of technical elements, we suggest a generic solution. It supports the multi-site collection and management of biomedical data. Security measures are multi-tier pseudonymity and physical separation of data over independent backend servers. Integrated views are provided by a web-based user interface. Our approach has been successfully used to implement a national and an international rare disease network. CONCLUSIONS: We were able to identify a set of core requirements out of several pseudonymization models. Considering various implementation options, we realized a generic solution which was implemented and deployed in research networks. Still, further conceptual work on pseudonymity is needed. Specifically, it remains unclear how exactly data is to be separated into distributed subsets. Moreover, a thorough risk and threat analysis is needed. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-015-0222-y) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-30 /pmc/articles/PMC4665916/ /pubmed/26621059 http://dx.doi.org/10.1186/s12911-015-0222-y Text en © Lautenschläger et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Advance Lautenschläger, Ronald Kohlmayer, Florian Prasser, Fabian Kuhn, Klaus A. A generic solution for web-based management of pseudonymized data |
title | A generic solution for web-based management of pseudonymized data |
title_full | A generic solution for web-based management of pseudonymized data |
title_fullStr | A generic solution for web-based management of pseudonymized data |
title_full_unstemmed | A generic solution for web-based management of pseudonymized data |
title_short | A generic solution for web-based management of pseudonymized data |
title_sort | generic solution for web-based management of pseudonymized data |
topic | Technical Advance |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4665916/ https://www.ncbi.nlm.nih.gov/pubmed/26621059 http://dx.doi.org/10.1186/s12911-015-0222-y |
work_keys_str_mv | AT lautenschlagerronald agenericsolutionforwebbasedmanagementofpseudonymizeddata AT kohlmayerflorian agenericsolutionforwebbasedmanagementofpseudonymizeddata AT prasserfabian agenericsolutionforwebbasedmanagementofpseudonymizeddata AT kuhnklausa agenericsolutionforwebbasedmanagementofpseudonymizeddata AT lautenschlagerronald genericsolutionforwebbasedmanagementofpseudonymizeddata AT kohlmayerflorian genericsolutionforwebbasedmanagementofpseudonymizeddata AT prasserfabian genericsolutionforwebbasedmanagementofpseudonymizeddata AT kuhnklausa genericsolutionforwebbasedmanagementofpseudonymizeddata |