Cargando…

Optimal adjustment sets for causal query estimation in partially observed biomolecular networks

Causal query estimation in biomolecular networks commonly selects a ‘valid adjustment set’, i.e. a subset of network variables that eliminates the bias of the estimator. A same query may have multiple valid adjustment sets, each with a different variance. When networks are partially observed, curren...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammad-Taheri, Sara, Tewari, Vartika, Kapre, Rohan, Rahiminasab, Ehsan, Sachs, Karen, Tapley Hoyt, Charles, Zucker, Jeremy, Vitek, Olga
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311316/
https://www.ncbi.nlm.nih.gov/pubmed/37387179
http://dx.doi.org/10.1093/bioinformatics/btad270
_version_ 1785066717531078656
author Mohammad-Taheri, Sara
Tewari, Vartika
Kapre, Rohan
Rahiminasab, Ehsan
Sachs, Karen
Tapley Hoyt, Charles
Zucker, Jeremy
Vitek, Olga
author_facet Mohammad-Taheri, Sara
Tewari, Vartika
Kapre, Rohan
Rahiminasab, Ehsan
Sachs, Karen
Tapley Hoyt, Charles
Zucker, Jeremy
Vitek, Olga
author_sort Mohammad-Taheri, Sara
collection PubMed
description Causal query estimation in biomolecular networks commonly selects a ‘valid adjustment set’, i.e. a subset of network variables that eliminates the bias of the estimator. A same query may have multiple valid adjustment sets, each with a different variance. When networks are partially observed, current methods use graph-based criteria to find an adjustment set that minimizes asymptotic variance. Unfortunately, many models that share the same graph topology, and therefore same functional dependencies, may differ in the processes that generate the observational data. In these cases, the topology-based criteria fail to distinguish the variances of the adjustment sets. This deficiency can lead to sub-optimal adjustment sets, and to miss-characterization of the effect of the intervention. We propose an approach for deriving ‘optimal adjustment sets’ that takes into account the nature of the data, bias and finite-sample variance of the estimator, and cost. It empirically learns the data generating processes from historical experimental data, and characterizes the properties of the estimators by simulation. We demonstrate the utility of the proposed approach in four biomolecular Case studies with different topologies and different data generation processes. The implementation and reproducible Case studies are at https://github.com/srtaheri/OptimalAdjustmentSet.
format Online
Article
Text
id pubmed-10311316
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103113162023-07-01 Optimal adjustment sets for causal query estimation in partially observed biomolecular networks Mohammad-Taheri, Sara Tewari, Vartika Kapre, Rohan Rahiminasab, Ehsan Sachs, Karen Tapley Hoyt, Charles Zucker, Jeremy Vitek, Olga Bioinformatics Systems Biology and Networks Causal query estimation in biomolecular networks commonly selects a ‘valid adjustment set’, i.e. a subset of network variables that eliminates the bias of the estimator. A same query may have multiple valid adjustment sets, each with a different variance. When networks are partially observed, current methods use graph-based criteria to find an adjustment set that minimizes asymptotic variance. Unfortunately, many models that share the same graph topology, and therefore same functional dependencies, may differ in the processes that generate the observational data. In these cases, the topology-based criteria fail to distinguish the variances of the adjustment sets. This deficiency can lead to sub-optimal adjustment sets, and to miss-characterization of the effect of the intervention. We propose an approach for deriving ‘optimal adjustment sets’ that takes into account the nature of the data, bias and finite-sample variance of the estimator, and cost. It empirically learns the data generating processes from historical experimental data, and characterizes the properties of the estimators by simulation. We demonstrate the utility of the proposed approach in four biomolecular Case studies with different topologies and different data generation processes. The implementation and reproducible Case studies are at https://github.com/srtaheri/OptimalAdjustmentSet. Oxford University Press 2023-06-30 /pmc/articles/PMC10311316/ /pubmed/37387179 http://dx.doi.org/10.1093/bioinformatics/btad270 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Systems Biology and Networks
Mohammad-Taheri, Sara
Tewari, Vartika
Kapre, Rohan
Rahiminasab, Ehsan
Sachs, Karen
Tapley Hoyt, Charles
Zucker, Jeremy
Vitek, Olga
Optimal adjustment sets for causal query estimation in partially observed biomolecular networks
title Optimal adjustment sets for causal query estimation in partially observed biomolecular networks
title_full Optimal adjustment sets for causal query estimation in partially observed biomolecular networks
title_fullStr Optimal adjustment sets for causal query estimation in partially observed biomolecular networks
title_full_unstemmed Optimal adjustment sets for causal query estimation in partially observed biomolecular networks
title_short Optimal adjustment sets for causal query estimation in partially observed biomolecular networks
title_sort optimal adjustment sets for causal query estimation in partially observed biomolecular networks
topic Systems Biology and Networks
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311316/
https://www.ncbi.nlm.nih.gov/pubmed/37387179
http://dx.doi.org/10.1093/bioinformatics/btad270
work_keys_str_mv AT mohammadtaherisara optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks
AT tewarivartika optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks
AT kaprerohan optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks
AT rahiminasabehsan optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks
AT sachskaren optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks
AT tapleyhoytcharles optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks
AT zuckerjeremy optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks
AT vitekolga optimaladjustmentsetsforcausalqueryestimationinpartiallyobservedbiomolecularnetworks