Cargando…

Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction

BACKGROUND: Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on bot...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yijie, Qian, Xiaoning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5475323/
https://www.ncbi.nlm.nih.gov/pubmed/28361714
http://dx.doi.org/10.1186/s12918-017-0405-5
_version_ 1783244541931814912
author Wang, Yijie
Qian, Xiaoning
author_facet Wang, Yijie
Qian, Xiaoning
author_sort Wang, Yijie
collection PubMed
description BACKGROUND: Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on both of these topological properties. Most of them, depending on mathematical definitions of either “modularity” or “conductance”, have their own limitations: Modularity has the inherent resolution problem ignoring small protein complexes; and conductance characterizes the separability of complexes but fails to capture the interaction density within complexes. RESULTS: In this paper, we propose a two-step algorithm FLCD (Finding Low-Conductance sets with Dense interactions) to predict overlapping protein complexes with the desired topological structure, which is densely connected inside and well separated from the rest of the networks. First, FLCD detects well-separated subnetworks based on approximating a potential low-conductance set through a personalized PageRank vector from a protein and then solving a mixed integer programming (MIP) problem to find the minimum-conductance set within the identified low-conductance set. At the second step, the densely connected parts in those subnetworks are discovered as the protein complexes by solving another MIP problem that aims to find the dense subnetwork in the minimum-conductance set. CONCLUSION: Experiments on four large-scale yeast PPI networks from different public databases demonstrate that the complexes predicted by FLCD have better correspondence with the yeast protein complex gold standards than other three state-of-the-art algorithms (ClusterONE, LinkComm, and SR-MCL). Additionally, results of FLCD show higher biological relevance with respect to Gene Ontology (GO) terms by GO enrichment analysis.
format Online
Article
Text
id pubmed-5475323
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54753232017-06-22 Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction Wang, Yijie Qian, Xiaoning BMC Syst Biol Research BACKGROUND: Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on both of these topological properties. Most of them, depending on mathematical definitions of either “modularity” or “conductance”, have their own limitations: Modularity has the inherent resolution problem ignoring small protein complexes; and conductance characterizes the separability of complexes but fails to capture the interaction density within complexes. RESULTS: In this paper, we propose a two-step algorithm FLCD (Finding Low-Conductance sets with Dense interactions) to predict overlapping protein complexes with the desired topological structure, which is densely connected inside and well separated from the rest of the networks. First, FLCD detects well-separated subnetworks based on approximating a potential low-conductance set through a personalized PageRank vector from a protein and then solving a mixed integer programming (MIP) problem to find the minimum-conductance set within the identified low-conductance set. At the second step, the densely connected parts in those subnetworks are discovered as the protein complexes by solving another MIP problem that aims to find the dense subnetwork in the minimum-conductance set. CONCLUSION: Experiments on four large-scale yeast PPI networks from different public databases demonstrate that the complexes predicted by FLCD have better correspondence with the yeast protein complex gold standards than other three state-of-the-art algorithms (ClusterONE, LinkComm, and SR-MCL). Additionally, results of FLCD show higher biological relevance with respect to Gene Ontology (GO) terms by GO enrichment analysis. BioMed Central 2017-03-14 /pmc/articles/PMC5475323/ /pubmed/28361714 http://dx.doi.org/10.1186/s12918-017-0405-5 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Wang, Yijie
Qian, Xiaoning
Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction
title Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction
title_full Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction
title_fullStr Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction
title_full_unstemmed Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction
title_short Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction
title_sort finding low-conductance sets with dense interactions (flcd) for better protein complex prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5475323/
https://www.ncbi.nlm.nih.gov/pubmed/28361714
http://dx.doi.org/10.1186/s12918-017-0405-5
work_keys_str_mv AT wangyijie findinglowconductancesetswithdenseinteractionsflcdforbetterproteincomplexprediction
AT qianxiaoning findinglowconductancesetswithdenseinteractionsflcdforbetterproteincomplexprediction