Cargando…

Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks

BACKGROUND: Systematic fusion of multiple data sources for Gene Regulatory Networks (GRN) inference remains a key challenge in systems biology. We incorporate information from protein-protein interaction networks (PPIN) into the process of GRN inference from gene expression (GE) data. However, exist...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Wenting, Rajapakse, Jagath C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6449891/
https://www.ncbi.nlm.nih.gov/pubmed/30953534
http://dx.doi.org/10.1186/s12918-019-0695-x
_version_ 1783408943173730304
author Liu, Wenting
Rajapakse, Jagath C.
author_facet Liu, Wenting
Rajapakse, Jagath C.
author_sort Liu, Wenting
collection PubMed
description BACKGROUND: Systematic fusion of multiple data sources for Gene Regulatory Networks (GRN) inference remains a key challenge in systems biology. We incorporate information from protein-protein interaction networks (PPIN) into the process of GRN inference from gene expression (GE) data. However, existing PPIN remain sparse and transitive protein interactions can help predict missing protein interactions. We therefore propose a systematic probabilistic framework on fusing GE data and transitive protein interaction data to coherently build GRN. RESULTS: We use a Gaussian Mixture Model (GMM) to soft-cluster GE data, allowing overlapping cluster memberships. Next, a heuristic method is proposed to extend sparse PPIN by incorporating transitive linkages. We then propose a novel way to score extended protein interactions by combining topological properties of PPIN and correlations of GE. Following this, GE data and extended PPIN are fused using a Gaussian Hidden Markov Model (GHMM) in order to identify gene regulatory pathways and refine interaction scores that are then used to constrain the GRN structure. We employ a Bayesian Gaussian Mixture (BGM) model to refine the GRN derived from GE data by using the structural priors derived from GHMM. Experiments on real yeast regulatory networks demonstrate both the feasibility of the extended PPIN in predicting transitive protein interactions and its effectiveness on improving the coverage and accuracy the proposed method of fusing PPIN and GE to build GRN. CONCLUSION: The GE and PPIN fusion model outperforms both the state-of-the-art single data source models (CLR, GENIE3, TIGRESS) as well as existing fusion models under various constraints.
format Online
Article
Text
id pubmed-6449891
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64498912019-04-15 Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks Liu, Wenting Rajapakse, Jagath C. BMC Syst Biol Research BACKGROUND: Systematic fusion of multiple data sources for Gene Regulatory Networks (GRN) inference remains a key challenge in systems biology. We incorporate information from protein-protein interaction networks (PPIN) into the process of GRN inference from gene expression (GE) data. However, existing PPIN remain sparse and transitive protein interactions can help predict missing protein interactions. We therefore propose a systematic probabilistic framework on fusing GE data and transitive protein interaction data to coherently build GRN. RESULTS: We use a Gaussian Mixture Model (GMM) to soft-cluster GE data, allowing overlapping cluster memberships. Next, a heuristic method is proposed to extend sparse PPIN by incorporating transitive linkages. We then propose a novel way to score extended protein interactions by combining topological properties of PPIN and correlations of GE. Following this, GE data and extended PPIN are fused using a Gaussian Hidden Markov Model (GHMM) in order to identify gene regulatory pathways and refine interaction scores that are then used to constrain the GRN structure. We employ a Bayesian Gaussian Mixture (BGM) model to refine the GRN derived from GE data by using the structural priors derived from GHMM. Experiments on real yeast regulatory networks demonstrate both the feasibility of the extended PPIN in predicting transitive protein interactions and its effectiveness on improving the coverage and accuracy the proposed method of fusing PPIN and GE to build GRN. CONCLUSION: The GE and PPIN fusion model outperforms both the state-of-the-art single data source models (CLR, GENIE3, TIGRESS) as well as existing fusion models under various constraints. BioMed Central 2019-04-05 /pmc/articles/PMC6449891/ /pubmed/30953534 http://dx.doi.org/10.1186/s12918-019-0695-x Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Liu, Wenting
Rajapakse, Jagath C.
Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
title Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
title_full Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
title_fullStr Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
title_full_unstemmed Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
title_short Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
title_sort fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6449891/
https://www.ncbi.nlm.nih.gov/pubmed/30953534
http://dx.doi.org/10.1186/s12918-019-0695-x
work_keys_str_mv AT liuwenting fusinggeneexpressionsandtransitiveproteinproteininteractionsforinferenceofgeneregulatorynetworks
AT rajapaksejagathc fusinggeneexpressionsandtransitiveproteinproteininteractionsforinferenceofgeneregulatorynetworks