Cargando…
Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure
In this article we use high-throughput epigenomics, transcriptomics, and proteomics data to construct fine-graded models of the “protein-coding units” gathering all transcript isoforms and chromatin accessibility peaks associated with more than 4000 genes in humans. Each protein-coding unit has the...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8958332/ https://www.ncbi.nlm.nih.gov/pubmed/35355520 http://dx.doi.org/10.1016/j.isci.2022.104048 |
_version_ | 1784676921915736064 |
---|---|
author | Zenere, Alberto Rundquist, Olof Gustafsson, Mika Altafini, Claudio |
author_facet | Zenere, Alberto Rundquist, Olof Gustafsson, Mika Altafini, Claudio |
author_sort | Zenere, Alberto |
collection | PubMed |
description | In this article we use high-throughput epigenomics, transcriptomics, and proteomics data to construct fine-graded models of the “protein-coding units” gathering all transcript isoforms and chromatin accessibility peaks associated with more than 4000 genes in humans. Each protein-coding unit has the structure of a directed acyclic graph (DAG) and can be represented as a Bayesian network. The factorization of the joint probability distribution induced by the DAGs imposes a number of conditional independence relationships among the variables forming a protein-coding unit, corresponding to the missing edges in the DAGs. We show that a large fraction of these conditional independencies are indeed verified by the data. Factors driving this verification appear to be the structural and functional annotation of the transcript isoforms, as well as a notion of structural balance (or frustration-free) of the corresponding sample correlation graph, which naturally leads to reduction of correlation (and hence to independence) upon conditioning. |
format | Online Article Text |
id | pubmed-8958332 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-89583322022-03-29 Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure Zenere, Alberto Rundquist, Olof Gustafsson, Mika Altafini, Claudio iScience Article In this article we use high-throughput epigenomics, transcriptomics, and proteomics data to construct fine-graded models of the “protein-coding units” gathering all transcript isoforms and chromatin accessibility peaks associated with more than 4000 genes in humans. Each protein-coding unit has the structure of a directed acyclic graph (DAG) and can be represented as a Bayesian network. The factorization of the joint probability distribution induced by the DAGs imposes a number of conditional independence relationships among the variables forming a protein-coding unit, corresponding to the missing edges in the DAGs. We show that a large fraction of these conditional independencies are indeed verified by the data. Factors driving this verification appear to be the structural and functional annotation of the transcript isoforms, as well as a notion of structural balance (or frustration-free) of the corresponding sample correlation graph, which naturally leads to reduction of correlation (and hence to independence) upon conditioning. Elsevier 2022-03-11 /pmc/articles/PMC8958332/ /pubmed/35355520 http://dx.doi.org/10.1016/j.isci.2022.104048 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zenere, Alberto Rundquist, Olof Gustafsson, Mika Altafini, Claudio Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure |
title | Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure |
title_full | Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure |
title_fullStr | Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure |
title_full_unstemmed | Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure |
title_short | Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure |
title_sort | multi-omics protein-coding units as massively parallel bayesian networks: empirical validation of causality structure |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8958332/ https://www.ncbi.nlm.nih.gov/pubmed/35355520 http://dx.doi.org/10.1016/j.isci.2022.104048 |
work_keys_str_mv | AT zenerealberto multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure AT rundquistolof multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure AT gustafssonmika multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure AT altafiniclaudio multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure |