Cargando…

Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations

A popular model for global scientific repositories is the data commons, which pools or connects many datasets alongside supporting infrastructure. A data commons must establish legally interoperability between datasets to ensure researchers can aggregate and reuse them. This is usually achieved by e...

Descripción completa

Detalles Bibliográficos
Autor principal: Thorogood, Adrian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7454728/
https://www.ncbi.nlm.nih.gov/pubmed/33005429
http://dx.doi.org/10.1093/jlb/lsaa065
_version_ 1783575522660319232
author Thorogood, Adrian
author_facet Thorogood, Adrian
author_sort Thorogood, Adrian
collection PubMed
description A popular model for global scientific repositories is the data commons, which pools or connects many datasets alongside supporting infrastructure. A data commons must establish legally interoperability between datasets to ensure researchers can aggregate and reuse them. This is usually achieved by establishing a shared governance structure. Unfortunately, governance often takes years to negotiate and involves a trade-off between data inclusion and data availability. It can also be difficult for repositories to modify governance structures in response to changing scientific priorities, data sharing practices, or legal frameworks. This problem has been laid bare by the sudden shock of the COVID-19 pandemic. This paper proposes a rapid and flexible strategy for scientific repositories to achieve legal interoperability: the policy-aware data lake. This strategy draws on technical concepts of modularity, metadata, and data lakes. Datasets are treated as independent modules, which can be subject to distinctive legal requirements. Each module must, however, be described using standard legal metadata. This allows legally compatible datasets to be rapidly combined and made available on a just-in-time basis to certain researchers for certain purposes. Global scientific repositories increasingly need such flexibility to manage scientific, organizational, and legal complexity, and to improve their responsiveness to global pandemics.
format Online
Article
Text
id pubmed-7454728
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-74547282020-08-31 Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations Thorogood, Adrian J Law Biosci Original Article A popular model for global scientific repositories is the data commons, which pools or connects many datasets alongside supporting infrastructure. A data commons must establish legally interoperability between datasets to ensure researchers can aggregate and reuse them. This is usually achieved by establishing a shared governance structure. Unfortunately, governance often takes years to negotiate and involves a trade-off between data inclusion and data availability. It can also be difficult for repositories to modify governance structures in response to changing scientific priorities, data sharing practices, or legal frameworks. This problem has been laid bare by the sudden shock of the COVID-19 pandemic. This paper proposes a rapid and flexible strategy for scientific repositories to achieve legal interoperability: the policy-aware data lake. This strategy draws on technical concepts of modularity, metadata, and data lakes. Datasets are treated as independent modules, which can be subject to distinctive legal requirements. Each module must, however, be described using standard legal metadata. This allows legally compatible datasets to be rapidly combined and made available on a just-in-time basis to certain researchers for certain purposes. Global scientific repositories increasingly need such flexibility to manage scientific, organizational, and legal complexity, and to improve their responsiveness to global pandemics. Oxford University Press 2020-08-19 /pmc/articles/PMC7454728/ /pubmed/33005429 http://dx.doi.org/10.1093/jlb/lsaa065 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Duke University School of Law, Harvard Law School, Oxford University Press, and Stanford Law School. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Article
Thorogood, Adrian
Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
title Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
title_full Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
title_fullStr Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
title_full_unstemmed Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
title_short Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
title_sort policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7454728/
https://www.ncbi.nlm.nih.gov/pubmed/33005429
http://dx.doi.org/10.1093/jlb/lsaa065
work_keys_str_mv AT thorogoodadrian policyawaredatalakesaflexibleapproachtoachievelegalinteroperabilityforglobalresearchcollaborations