Cargando…

Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine

Biomedical practice is evidence-based. Peer-reviewed papers are the primary medium to present evidence and data-supported results to drive clinical practice. However, it could be argued that scientific literature does not contain data, but rather narratives about and summaries of data. Meta-analyses...

Descripción completa

Detalles Bibliográficos
Autores principales: Torres-Espín, Abel, Ferguson, Adam R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9681014/
https://www.ncbi.nlm.nih.gov/pubmed/36420049
http://dx.doi.org/10.1162/99608f92.a9717b34
_version_ 1784834527379587072
author Torres-Espín, Abel
Ferguson, Adam R
author_facet Torres-Espín, Abel
Ferguson, Adam R
author_sort Torres-Espín, Abel
collection PubMed
description Biomedical practice is evidence-based. Peer-reviewed papers are the primary medium to present evidence and data-supported results to drive clinical practice. However, it could be argued that scientific literature does not contain data, but rather narratives about and summaries of data. Meta-analyses of published literature may produce biased conclusions due to the lack of transparency in data collection, publication bias, and inaccessibility to the data underlying a publication (‘dark data’). Co-analysis of pooled data at the level of individual research participants can offer higher levels of evidence, but this requires that researchers share raw individual participant data (IPD). FAIR (findable, accessible, interoperable, and reusable) data governance principles aim to guide data lifecycle management by providing a framework for actionable data sharing. Here we discuss the implications of FAIR for data harmonization, an essential step for pooling data for IPD analysis. We describe the harmonization-information trade-off, which states that the level of granularity in harmonizing data determines the amount of information lost. Finally, we discuss a framework for managing the trade-off and the levels of harmonization. In the coming era of funder mandates for data sharing, research communities that effectively manage data harmonization will be empowered to harness big data and advanced analytics such as machine learning and artificial intelligence tools, leading to stunning new discoveries that augment our understanding of diseases and their treatments. By elevating scientific data to the status of a first-class citizen of the scientific enterprise, there is strong potential for biomedicine to transition from a narrative publication product orientation to a modern data-driven enterprise where data itself is viewed as a primary work product of biomedical research.
format Online
Article
Text
id pubmed-9681014
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-96810142022-11-22 Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine Torres-Espín, Abel Ferguson, Adam R Harv Data Sci Rev Article Biomedical practice is evidence-based. Peer-reviewed papers are the primary medium to present evidence and data-supported results to drive clinical practice. However, it could be argued that scientific literature does not contain data, but rather narratives about and summaries of data. Meta-analyses of published literature may produce biased conclusions due to the lack of transparency in data collection, publication bias, and inaccessibility to the data underlying a publication (‘dark data’). Co-analysis of pooled data at the level of individual research participants can offer higher levels of evidence, but this requires that researchers share raw individual participant data (IPD). FAIR (findable, accessible, interoperable, and reusable) data governance principles aim to guide data lifecycle management by providing a framework for actionable data sharing. Here we discuss the implications of FAIR for data harmonization, an essential step for pooling data for IPD analysis. We describe the harmonization-information trade-off, which states that the level of granularity in harmonizing data determines the amount of information lost. Finally, we discuss a framework for managing the trade-off and the levels of harmonization. In the coming era of funder mandates for data sharing, research communities that effectively manage data harmonization will be empowered to harness big data and advanced analytics such as machine learning and artificial intelligence tools, leading to stunning new discoveries that augment our understanding of diseases and their treatments. By elevating scientific data to the status of a first-class citizen of the scientific enterprise, there is strong potential for biomedicine to transition from a narrative publication product orientation to a modern data-driven enterprise where data itself is viewed as a primary work product of biomedical research. 2022 2022-07-28 /pmc/articles/PMC9681014/ /pubmed/36420049 http://dx.doi.org/10.1162/99608f92.a9717b34 Text en https://creativecommons.org/licenses/by/4.0/This article is licensed under a Creative Commons Attribution (CC BY 4.0) International license, except where otherwise indicated with respect to particular material included in the article.
spellingShingle Article
Torres-Espín, Abel
Ferguson, Adam R
Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine
title Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine
title_full Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine
title_fullStr Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine
title_full_unstemmed Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine
title_short Harmonization-Information Trade-Offs for Sharing Individual Participant Data in Biomedicine
title_sort harmonization-information trade-offs for sharing individual participant data in biomedicine
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9681014/
https://www.ncbi.nlm.nih.gov/pubmed/36420049
http://dx.doi.org/10.1162/99608f92.a9717b34
work_keys_str_mv AT torresespinabel harmonizationinformationtradeoffsforsharingindividualparticipantdatainbiomedicine
AT fergusonadamr harmonizationinformationtradeoffsforsharingindividualparticipantdatainbiomedicine