Cargando…

A generic solution for web-based management of pseudonymized data

BACKGROUND: Collaborative collection and sharing of data have become a core element of biomedical research. Typical applications are multi-site registries which collect sensitive person-related data prospectively, often together with biospecimens. To secure these sensitive data, national and interna...

Descripción completa

Detalles Bibliográficos
Autores principales: Lautenschläger, Ronald, Kohlmayer, Florian, Prasser, Fabian, Kuhn, Klaus A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4665916/
https://www.ncbi.nlm.nih.gov/pubmed/26621059
http://dx.doi.org/10.1186/s12911-015-0222-y
_version_ 1782403637478162432
author Lautenschläger, Ronald
Kohlmayer, Florian
Prasser, Fabian
Kuhn, Klaus A.
author_facet Lautenschläger, Ronald
Kohlmayer, Florian
Prasser, Fabian
Kuhn, Klaus A.
author_sort Lautenschläger, Ronald
collection PubMed
description BACKGROUND: Collaborative collection and sharing of data have become a core element of biomedical research. Typical applications are multi-site registries which collect sensitive person-related data prospectively, often together with biospecimens. To secure these sensitive data, national and international data protection laws and regulations demand the separation of identifying data from biomedical data and to introduce pseudonyms. Neither the formulation in laws and regulations nor existing pseudonymization concepts, however, are precise enough to directly provide an implementation guideline. We therefore describe core requirements as well as implementation options for registries and study databases with sensitive biomedical data. METHODS: We first analyze existing concepts and compile a set of fundamental requirements for pseudonymized data management. Then we derive a system architecture that fulfills these requirements. Next, we provide a comprehensive overview and a comparison of different technical options for an implementation. Finally, we develop a generic software solution for managing pseudonymized data and show its feasibility by describing how we have used it to realize two research networks. RESULTS: We have found that pseudonymization models are highly heterogeneous, already on a conceptual level. We have compiled a set of requirements from different pseudonymization schemes. We propose an architecture and present an overview of technical options. Based on a selection of technical elements, we suggest a generic solution. It supports the multi-site collection and management of biomedical data. Security measures are multi-tier pseudonymity and physical separation of data over independent backend servers. Integrated views are provided by a web-based user interface. Our approach has been successfully used to implement a national and an international rare disease network. CONCLUSIONS: We were able to identify a set of core requirements out of several pseudonymization models. Considering various implementation options, we realized a generic solution which was implemented and deployed in research networks. Still, further conceptual work on pseudonymity is needed. Specifically, it remains unclear how exactly data is to be separated into distributed subsets. Moreover, a thorough risk and threat analysis is needed. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-015-0222-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4665916
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46659162015-12-02 A generic solution for web-based management of pseudonymized data Lautenschläger, Ronald Kohlmayer, Florian Prasser, Fabian Kuhn, Klaus A. BMC Med Inform Decis Mak Technical Advance BACKGROUND: Collaborative collection and sharing of data have become a core element of biomedical research. Typical applications are multi-site registries which collect sensitive person-related data prospectively, often together with biospecimens. To secure these sensitive data, national and international data protection laws and regulations demand the separation of identifying data from biomedical data and to introduce pseudonyms. Neither the formulation in laws and regulations nor existing pseudonymization concepts, however, are precise enough to directly provide an implementation guideline. We therefore describe core requirements as well as implementation options for registries and study databases with sensitive biomedical data. METHODS: We first analyze existing concepts and compile a set of fundamental requirements for pseudonymized data management. Then we derive a system architecture that fulfills these requirements. Next, we provide a comprehensive overview and a comparison of different technical options for an implementation. Finally, we develop a generic software solution for managing pseudonymized data and show its feasibility by describing how we have used it to realize two research networks. RESULTS: We have found that pseudonymization models are highly heterogeneous, already on a conceptual level. We have compiled a set of requirements from different pseudonymization schemes. We propose an architecture and present an overview of technical options. Based on a selection of technical elements, we suggest a generic solution. It supports the multi-site collection and management of biomedical data. Security measures are multi-tier pseudonymity and physical separation of data over independent backend servers. Integrated views are provided by a web-based user interface. Our approach has been successfully used to implement a national and an international rare disease network. CONCLUSIONS: We were able to identify a set of core requirements out of several pseudonymization models. Considering various implementation options, we realized a generic solution which was implemented and deployed in research networks. Still, further conceptual work on pseudonymity is needed. Specifically, it remains unclear how exactly data is to be separated into distributed subsets. Moreover, a thorough risk and threat analysis is needed. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-015-0222-y) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-30 /pmc/articles/PMC4665916/ /pubmed/26621059 http://dx.doi.org/10.1186/s12911-015-0222-y Text en © Lautenschläger et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Advance
Lautenschläger, Ronald
Kohlmayer, Florian
Prasser, Fabian
Kuhn, Klaus A.
A generic solution for web-based management of pseudonymized data
title A generic solution for web-based management of pseudonymized data
title_full A generic solution for web-based management of pseudonymized data
title_fullStr A generic solution for web-based management of pseudonymized data
title_full_unstemmed A generic solution for web-based management of pseudonymized data
title_short A generic solution for web-based management of pseudonymized data
title_sort generic solution for web-based management of pseudonymized data
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4665916/
https://www.ncbi.nlm.nih.gov/pubmed/26621059
http://dx.doi.org/10.1186/s12911-015-0222-y
work_keys_str_mv AT lautenschlagerronald agenericsolutionforwebbasedmanagementofpseudonymizeddata
AT kohlmayerflorian agenericsolutionforwebbasedmanagementofpseudonymizeddata
AT prasserfabian agenericsolutionforwebbasedmanagementofpseudonymizeddata
AT kuhnklausa agenericsolutionforwebbasedmanagementofpseudonymizeddata
AT lautenschlagerronald genericsolutionforwebbasedmanagementofpseudonymizeddata
AT kohlmayerflorian genericsolutionforwebbasedmanagementofpseudonymizeddata
AT prasserfabian genericsolutionforwebbasedmanagementofpseudonymizeddata
AT kuhnklausa genericsolutionforwebbasedmanagementofpseudonymizeddata