Cargando…

Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure

In this article we use high-throughput epigenomics, transcriptomics, and proteomics data to construct fine-graded models of the “protein-coding units” gathering all transcript isoforms and chromatin accessibility peaks associated with more than 4000 genes in humans. Each protein-coding unit has the...

Descripción completa

Detalles Bibliográficos
Autores principales: Zenere, Alberto, Rundquist, Olof, Gustafsson, Mika, Altafini, Claudio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8958332/
https://www.ncbi.nlm.nih.gov/pubmed/35355520
http://dx.doi.org/10.1016/j.isci.2022.104048
_version_ 1784676921915736064
author Zenere, Alberto
Rundquist, Olof
Gustafsson, Mika
Altafini, Claudio
author_facet Zenere, Alberto
Rundquist, Olof
Gustafsson, Mika
Altafini, Claudio
author_sort Zenere, Alberto
collection PubMed
description In this article we use high-throughput epigenomics, transcriptomics, and proteomics data to construct fine-graded models of the “protein-coding units” gathering all transcript isoforms and chromatin accessibility peaks associated with more than 4000 genes in humans. Each protein-coding unit has the structure of a directed acyclic graph (DAG) and can be represented as a Bayesian network. The factorization of the joint probability distribution induced by the DAGs imposes a number of conditional independence relationships among the variables forming a protein-coding unit, corresponding to the missing edges in the DAGs. We show that a large fraction of these conditional independencies are indeed verified by the data. Factors driving this verification appear to be the structural and functional annotation of the transcript isoforms, as well as a notion of structural balance (or frustration-free) of the corresponding sample correlation graph, which naturally leads to reduction of correlation (and hence to independence) upon conditioning.
format Online
Article
Text
id pubmed-8958332
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-89583322022-03-29 Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure Zenere, Alberto Rundquist, Olof Gustafsson, Mika Altafini, Claudio iScience Article In this article we use high-throughput epigenomics, transcriptomics, and proteomics data to construct fine-graded models of the “protein-coding units” gathering all transcript isoforms and chromatin accessibility peaks associated with more than 4000 genes in humans. Each protein-coding unit has the structure of a directed acyclic graph (DAG) and can be represented as a Bayesian network. The factorization of the joint probability distribution induced by the DAGs imposes a number of conditional independence relationships among the variables forming a protein-coding unit, corresponding to the missing edges in the DAGs. We show that a large fraction of these conditional independencies are indeed verified by the data. Factors driving this verification appear to be the structural and functional annotation of the transcript isoforms, as well as a notion of structural balance (or frustration-free) of the corresponding sample correlation graph, which naturally leads to reduction of correlation (and hence to independence) upon conditioning. Elsevier 2022-03-11 /pmc/articles/PMC8958332/ /pubmed/35355520 http://dx.doi.org/10.1016/j.isci.2022.104048 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zenere, Alberto
Rundquist, Olof
Gustafsson, Mika
Altafini, Claudio
Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure
title Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure
title_full Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure
title_fullStr Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure
title_full_unstemmed Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure
title_short Multi-omics protein-coding units as massively parallel Bayesian networks: Empirical validation of causality structure
title_sort multi-omics protein-coding units as massively parallel bayesian networks: empirical validation of causality structure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8958332/
https://www.ncbi.nlm.nih.gov/pubmed/35355520
http://dx.doi.org/10.1016/j.isci.2022.104048
work_keys_str_mv AT zenerealberto multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure
AT rundquistolof multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure
AT gustafssonmika multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure
AT altafiniclaudio multiomicsproteincodingunitsasmassivelyparallelbayesiannetworksempiricalvalidationofcausalitystructure