Cargando…
Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
A popular model for global scientific repositories is the data commons, which pools or connects many datasets alongside supporting infrastructure. A data commons must establish legally interoperability between datasets to ensure researchers can aggregate and reuse them. This is usually achieved by e...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7454728/ https://www.ncbi.nlm.nih.gov/pubmed/33005429 http://dx.doi.org/10.1093/jlb/lsaa065 |
_version_ | 1783575522660319232 |
---|---|
author | Thorogood, Adrian |
author_facet | Thorogood, Adrian |
author_sort | Thorogood, Adrian |
collection | PubMed |
description | A popular model for global scientific repositories is the data commons, which pools or connects many datasets alongside supporting infrastructure. A data commons must establish legally interoperability between datasets to ensure researchers can aggregate and reuse them. This is usually achieved by establishing a shared governance structure. Unfortunately, governance often takes years to negotiate and involves a trade-off between data inclusion and data availability. It can also be difficult for repositories to modify governance structures in response to changing scientific priorities, data sharing practices, or legal frameworks. This problem has been laid bare by the sudden shock of the COVID-19 pandemic. This paper proposes a rapid and flexible strategy for scientific repositories to achieve legal interoperability: the policy-aware data lake. This strategy draws on technical concepts of modularity, metadata, and data lakes. Datasets are treated as independent modules, which can be subject to distinctive legal requirements. Each module must, however, be described using standard legal metadata. This allows legally compatible datasets to be rapidly combined and made available on a just-in-time basis to certain researchers for certain purposes. Global scientific repositories increasingly need such flexibility to manage scientific, organizational, and legal complexity, and to improve their responsiveness to global pandemics. |
format | Online Article Text |
id | pubmed-7454728 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-74547282020-08-31 Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations Thorogood, Adrian J Law Biosci Original Article A popular model for global scientific repositories is the data commons, which pools or connects many datasets alongside supporting infrastructure. A data commons must establish legally interoperability between datasets to ensure researchers can aggregate and reuse them. This is usually achieved by establishing a shared governance structure. Unfortunately, governance often takes years to negotiate and involves a trade-off between data inclusion and data availability. It can also be difficult for repositories to modify governance structures in response to changing scientific priorities, data sharing practices, or legal frameworks. This problem has been laid bare by the sudden shock of the COVID-19 pandemic. This paper proposes a rapid and flexible strategy for scientific repositories to achieve legal interoperability: the policy-aware data lake. This strategy draws on technical concepts of modularity, metadata, and data lakes. Datasets are treated as independent modules, which can be subject to distinctive legal requirements. Each module must, however, be described using standard legal metadata. This allows legally compatible datasets to be rapidly combined and made available on a just-in-time basis to certain researchers for certain purposes. Global scientific repositories increasingly need such flexibility to manage scientific, organizational, and legal complexity, and to improve their responsiveness to global pandemics. Oxford University Press 2020-08-19 /pmc/articles/PMC7454728/ /pubmed/33005429 http://dx.doi.org/10.1093/jlb/lsaa065 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Duke University School of Law, Harvard Law School, Oxford University Press, and Stanford Law School. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Article Thorogood, Adrian Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations |
title | Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations |
title_full | Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations |
title_fullStr | Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations |
title_full_unstemmed | Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations |
title_short | Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations |
title_sort | policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7454728/ https://www.ncbi.nlm.nih.gov/pubmed/33005429 http://dx.doi.org/10.1093/jlb/lsaa065 |
work_keys_str_mv | AT thorogoodadrian policyawaredatalakesaflexibleapproachtoachievelegalinteroperabilityforglobalresearchcollaborations |