Cargando…

Framework for federated causal inference based on real-world observational data sources (BY-COVID)

BACKGROUND: Causal inference techniques help researchers and policy-makers to evaluate public health interventions. Approaching causal inference by re-using routinely collected observational data across different regions is challenging and guidance is currently lacking. With the aim of filling this...

Descripción completa

Detalles Bibliográficos
Autores principales: Meurisse, M, Estupiñán-Romero, F, Van Goethem, N, González-Galindo, J, Bernal-Delgado, E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10597019/
http://dx.doi.org/10.1093/eurpub/ckad160.459
_version_ 1785125243029815296
author Meurisse, M
Estupiñán-Romero, F
Van Goethem, N
González-Galindo, J
Bernal-Delgado, E
author_facet Meurisse, M
Estupiñán-Romero, F
Van Goethem, N
González-Galindo, J
Bernal-Delgado, E
author_sort Meurisse, M
collection PubMed
description BACKGROUND: Causal inference techniques help researchers and policy-makers to evaluate public health interventions. Approaching causal inference by re-using routinely collected observational data across different regions is challenging and guidance is currently lacking. With the aim of filling this gap and allowing a rapid response in the case of a next pandemic, a methodological framework to estimate causal effects in a federated research infrastructure is showcased in the baseline use case of the European BY-COVID project. METHODS: A framework, implementing existing methodologies and expertise, is proposed, enabling federated research based on routinely collected sensitive individual-level data. The framework includes step-by-step guidelines, from defining a research question, to establishing a causal model, identifying and specifying data requirements in a data model, and developing an interoperable and reproducible analytical pipeline for distributed deployment. Using open-source software, a complete workflow implementing federated causal inference was prototyped to assess the real-world effectiveness of SARS-CoV-2 primary vaccination in preventing infection in populations spanning different countries, integrating a data quality assessment, imputation of missing values, matching of exposed to unexposed based on identified confounders and a survival analysis within the matched population. RESULTS: The proposed methodological framework was successfully demonstrated within the BY-COVID use case, illustrating its feasibility and value. Different Findable, Accessible, Interoperable and Reusable (FAIR) research objects were produced, such as a study protocol, a data management plan, data model and interoperable analytical pipeline. CONCLUSIONS: The framework provides a systematic approach to address policy-relevant causal research questions in a privacy-preserving way. The methodology and derived research objects can be re-used and contribute to pandemic preparedness. KEY MESSAGES: • Estimating causal effects using observational data in a privacy-preserving way was demonstrated to be achievable and can help policy-makers to evaluate public health interventions. • The methodology, constituted of Findable, Accessible, Interoperable and Reusable (FAIR) research objects, is generalizable to other research questions and contributes to pandemic preparedness.
format Online
Article
Text
id pubmed-10597019
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105970192023-10-25 Framework for federated causal inference based on real-world observational data sources (BY-COVID) Meurisse, M Estupiñán-Romero, F Van Goethem, N González-Galindo, J Bernal-Delgado, E Eur J Public Health Parallel Programme BACKGROUND: Causal inference techniques help researchers and policy-makers to evaluate public health interventions. Approaching causal inference by re-using routinely collected observational data across different regions is challenging and guidance is currently lacking. With the aim of filling this gap and allowing a rapid response in the case of a next pandemic, a methodological framework to estimate causal effects in a federated research infrastructure is showcased in the baseline use case of the European BY-COVID project. METHODS: A framework, implementing existing methodologies and expertise, is proposed, enabling federated research based on routinely collected sensitive individual-level data. The framework includes step-by-step guidelines, from defining a research question, to establishing a causal model, identifying and specifying data requirements in a data model, and developing an interoperable and reproducible analytical pipeline for distributed deployment. Using open-source software, a complete workflow implementing federated causal inference was prototyped to assess the real-world effectiveness of SARS-CoV-2 primary vaccination in preventing infection in populations spanning different countries, integrating a data quality assessment, imputation of missing values, matching of exposed to unexposed based on identified confounders and a survival analysis within the matched population. RESULTS: The proposed methodological framework was successfully demonstrated within the BY-COVID use case, illustrating its feasibility and value. Different Findable, Accessible, Interoperable and Reusable (FAIR) research objects were produced, such as a study protocol, a data management plan, data model and interoperable analytical pipeline. CONCLUSIONS: The framework provides a systematic approach to address policy-relevant causal research questions in a privacy-preserving way. The methodology and derived research objects can be re-used and contribute to pandemic preparedness. KEY MESSAGES: • Estimating causal effects using observational data in a privacy-preserving way was demonstrated to be achievable and can help policy-makers to evaluate public health interventions. • The methodology, constituted of Findable, Accessible, Interoperable and Reusable (FAIR) research objects, is generalizable to other research questions and contributes to pandemic preparedness. Oxford University Press 2023-10-24 /pmc/articles/PMC10597019/ http://dx.doi.org/10.1093/eurpub/ckad160.459 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the European Public Health Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Parallel Programme
Meurisse, M
Estupiñán-Romero, F
Van Goethem, N
González-Galindo, J
Bernal-Delgado, E
Framework for federated causal inference based on real-world observational data sources (BY-COVID)
title Framework for federated causal inference based on real-world observational data sources (BY-COVID)
title_full Framework for federated causal inference based on real-world observational data sources (BY-COVID)
title_fullStr Framework for federated causal inference based on real-world observational data sources (BY-COVID)
title_full_unstemmed Framework for federated causal inference based on real-world observational data sources (BY-COVID)
title_short Framework for federated causal inference based on real-world observational data sources (BY-COVID)
title_sort framework for federated causal inference based on real-world observational data sources (by-covid)
topic Parallel Programme
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10597019/
http://dx.doi.org/10.1093/eurpub/ckad160.459
work_keys_str_mv AT meurissem frameworkforfederatedcausalinferencebasedonrealworldobservationaldatasourcesbycovid
AT estupinanromerof frameworkforfederatedcausalinferencebasedonrealworldobservationaldatasourcesbycovid
AT vangoethemn frameworkforfederatedcausalinferencebasedonrealworldobservationaldatasourcesbycovid
AT gonzalezgalindoj frameworkforfederatedcausalinferencebasedonrealworldobservationaldatasourcesbycovid
AT bernaldelgadoe frameworkforfederatedcausalinferencebasedonrealworldobservationaldatasourcesbycovid