Cargando…

Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery

Causal discovery of genome-scale networks is important for identifying pathways from genes to observable traits –e.g. differences in cell function, disease, drug resistance and others. Causal learners based on graphical models rely on interventional samples to orient edges in the network. However, t...

Descripción completa

Detalles Bibliográficos
Autores principales: Shah, Ashka, Ramanathan, Arvind, Hayot-Sasson, Valerie, Stevens, Rick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cornell University 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104184/
https://www.ncbi.nlm.nih.gov/pubmed/37064526
_version_ 1785025986103869440
author Shah, Ashka
Ramanathan, Arvind
Hayot-Sasson, Valerie
Stevens, Rick
author_facet Shah, Ashka
Ramanathan, Arvind
Hayot-Sasson, Valerie
Stevens, Rick
author_sort Shah, Ashka
collection PubMed
description Causal discovery of genome-scale networks is important for identifying pathways from genes to observable traits –e.g. differences in cell function, disease, drug resistance and others. Causal learners based on graphical models rely on interventional samples to orient edges in the network. However, these models have not been shown to scale up the size of the genome, which are on the order of 10(3)-10(4) genes. We introduce a new learner, SP-GIES, that jointly learns from interventional and observational datasets and achieves almost 4x speedup against an existing learner for 1,000 node networks. SP-GIES achieves an AUC-PR score of 0.91 on 1,000 node networks, and scales up to 2,000 node networks – this is 4x larger than existing works. We also show how SP-GIES improves downstream optimal experimental design strategies for selecting interventional experiments to perform on the system. This is an important step forward in realizing causal discovery at scale via autonomous experimental design.
format Online
Article
Text
id pubmed-10104184
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cornell University
record_format MEDLINE/PubMed
spelling pubmed-101041842023-04-15 Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery Shah, Ashka Ramanathan, Arvind Hayot-Sasson, Valerie Stevens, Rick ArXiv Article Causal discovery of genome-scale networks is important for identifying pathways from genes to observable traits –e.g. differences in cell function, disease, drug resistance and others. Causal learners based on graphical models rely on interventional samples to orient edges in the network. However, these models have not been shown to scale up the size of the genome, which are on the order of 10(3)-10(4) genes. We introduce a new learner, SP-GIES, that jointly learns from interventional and observational datasets and achieves almost 4x speedup against an existing learner for 1,000 node networks. SP-GIES achieves an AUC-PR score of 0.91 on 1,000 node networks, and scales up to 2,000 node networks – this is 4x larger than existing works. We also show how SP-GIES improves downstream optimal experimental design strategies for selecting interventional experiments to perform on the system. This is an important step forward in realizing causal discovery at scale via autonomous experimental design. Cornell University 2023-04-06 /pmc/articles/PMC10104184/ /pubmed/37064526 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Shah, Ashka
Ramanathan, Arvind
Hayot-Sasson, Valerie
Stevens, Rick
Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery
title Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery
title_full Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery
title_fullStr Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery
title_full_unstemmed Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery
title_short Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery
title_sort causal discovery and optimal experimental design for genome-scale biological network recovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104184/
https://www.ncbi.nlm.nih.gov/pubmed/37064526
work_keys_str_mv AT shahashka causaldiscoveryandoptimalexperimentaldesignforgenomescalebiologicalnetworkrecovery
AT ramanathanarvind causaldiscoveryandoptimalexperimentaldesignforgenomescalebiologicalnetworkrecovery
AT hayotsassonvalerie causaldiscoveryandoptimalexperimentaldesignforgenomescalebiologicalnetworkrecovery
AT stevensrick causaldiscoveryandoptimalexperimentaldesignforgenomescalebiologicalnetworkrecovery