Cargando…
Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies
Objectives : Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Georg Thieme Verlag KG
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9719789/ https://www.ncbi.nlm.nih.gov/pubmed/36463884 http://dx.doi.org/10.1055/s-0042-1742522 |
_version_ | 1784843403638341632 |
---|---|
author | Swertz, Morris van Enckevort, Esther Oliveira, José Luis Fortier, Isabel Bergeron, Julie Thurin, Nicolas H. Hyde, Eleanor Kellmann, Alexander Pahoueshnja, Romin Sturkenboom, Miriam Cunnington, Marianne Nybo Andersen, Anne-Marie Marcon, Yannick Gonçalves, Gonçalo Gini, Rosa |
author_facet | Swertz, Morris van Enckevort, Esther Oliveira, José Luis Fortier, Isabel Bergeron, Julie Thurin, Nicolas H. Hyde, Eleanor Kellmann, Alexander Pahoueshnja, Romin Sturkenboom, Miriam Cunnington, Marianne Nybo Andersen, Anne-Marie Marcon, Yannick Gonçalves, Gonçalo Gini, Rosa |
author_sort | Swertz, Morris |
collection | PubMed |
description | Objectives : Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness. Methods : In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records. Results : We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact. Conclusions : We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata ‘common data elements’ using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR. |
format | Online Article Text |
id | pubmed-9719789 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Georg Thieme Verlag KG |
record_format | MEDLINE/PubMed |
spelling | pubmed-97197892022-12-05 Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies Swertz, Morris van Enckevort, Esther Oliveira, José Luis Fortier, Isabel Bergeron, Julie Thurin, Nicolas H. Hyde, Eleanor Kellmann, Alexander Pahoueshnja, Romin Sturkenboom, Miriam Cunnington, Marianne Nybo Andersen, Anne-Marie Marcon, Yannick Gonçalves, Gonçalo Gini, Rosa Yearb Med Inform Objectives : Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness. Methods : In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records. Results : We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact. Conclusions : We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata ‘common data elements’ using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR. Georg Thieme Verlag KG 2022-12-04 /pmc/articles/PMC9719789/ /pubmed/36463884 http://dx.doi.org/10.1055/s-0042-1742522 Text en IMIA and Thieme. This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. ( https://creativecommons.org/licenses/by-nc-nd/4.0/ ) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited. |
spellingShingle | Swertz, Morris van Enckevort, Esther Oliveira, José Luis Fortier, Isabel Bergeron, Julie Thurin, Nicolas H. Hyde, Eleanor Kellmann, Alexander Pahoueshnja, Romin Sturkenboom, Miriam Cunnington, Marianne Nybo Andersen, Anne-Marie Marcon, Yannick Gonçalves, Gonçalo Gini, Rosa Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies |
title | Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies |
title_full | Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies |
title_fullStr | Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies |
title_full_unstemmed | Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies |
title_short | Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies |
title_sort | towards an interoperable ecosystem of research cohort and real-world data catalogues enabling multi-center studies |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9719789/ https://www.ncbi.nlm.nih.gov/pubmed/36463884 http://dx.doi.org/10.1055/s-0042-1742522 |
work_keys_str_mv | AT swertzmorris towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT vanenckevortesther towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT oliveirajoseluis towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT fortierisabel towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT bergeronjulie towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT thurinnicolash towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT hydeeleanor towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT kellmannalexander towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT pahoueshnjaromin towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT sturkenboommiriam towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT cunningtonmarianne towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT nyboandersenannemarie towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT marconyannick towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT goncalvesgoncalo towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies AT ginirosa towardsaninteroperableecosystemofresearchcohortandrealworlddatacataloguesenablingmulticenterstudies |