Cargando…
Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction
BACKGROUND: Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on bot...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5475323/ https://www.ncbi.nlm.nih.gov/pubmed/28361714 http://dx.doi.org/10.1186/s12918-017-0405-5 |
_version_ | 1783244541931814912 |
---|---|
author | Wang, Yijie Qian, Xiaoning |
author_facet | Wang, Yijie Qian, Xiaoning |
author_sort | Wang, Yijie |
collection | PubMed |
description | BACKGROUND: Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on both of these topological properties. Most of them, depending on mathematical definitions of either “modularity” or “conductance”, have their own limitations: Modularity has the inherent resolution problem ignoring small protein complexes; and conductance characterizes the separability of complexes but fails to capture the interaction density within complexes. RESULTS: In this paper, we propose a two-step algorithm FLCD (Finding Low-Conductance sets with Dense interactions) to predict overlapping protein complexes with the desired topological structure, which is densely connected inside and well separated from the rest of the networks. First, FLCD detects well-separated subnetworks based on approximating a potential low-conductance set through a personalized PageRank vector from a protein and then solving a mixed integer programming (MIP) problem to find the minimum-conductance set within the identified low-conductance set. At the second step, the densely connected parts in those subnetworks are discovered as the protein complexes by solving another MIP problem that aims to find the dense subnetwork in the minimum-conductance set. CONCLUSION: Experiments on four large-scale yeast PPI networks from different public databases demonstrate that the complexes predicted by FLCD have better correspondence with the yeast protein complex gold standards than other three state-of-the-art algorithms (ClusterONE, LinkComm, and SR-MCL). Additionally, results of FLCD show higher biological relevance with respect to Gene Ontology (GO) terms by GO enrichment analysis. |
format | Online Article Text |
id | pubmed-5475323 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54753232017-06-22 Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction Wang, Yijie Qian, Xiaoning BMC Syst Biol Research BACKGROUND: Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on both of these topological properties. Most of them, depending on mathematical definitions of either “modularity” or “conductance”, have their own limitations: Modularity has the inherent resolution problem ignoring small protein complexes; and conductance characterizes the separability of complexes but fails to capture the interaction density within complexes. RESULTS: In this paper, we propose a two-step algorithm FLCD (Finding Low-Conductance sets with Dense interactions) to predict overlapping protein complexes with the desired topological structure, which is densely connected inside and well separated from the rest of the networks. First, FLCD detects well-separated subnetworks based on approximating a potential low-conductance set through a personalized PageRank vector from a protein and then solving a mixed integer programming (MIP) problem to find the minimum-conductance set within the identified low-conductance set. At the second step, the densely connected parts in those subnetworks are discovered as the protein complexes by solving another MIP problem that aims to find the dense subnetwork in the minimum-conductance set. CONCLUSION: Experiments on four large-scale yeast PPI networks from different public databases demonstrate that the complexes predicted by FLCD have better correspondence with the yeast protein complex gold standards than other three state-of-the-art algorithms (ClusterONE, LinkComm, and SR-MCL). Additionally, results of FLCD show higher biological relevance with respect to Gene Ontology (GO) terms by GO enrichment analysis. BioMed Central 2017-03-14 /pmc/articles/PMC5475323/ /pubmed/28361714 http://dx.doi.org/10.1186/s12918-017-0405-5 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Wang, Yijie Qian, Xiaoning Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction |
title | Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction |
title_full | Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction |
title_fullStr | Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction |
title_full_unstemmed | Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction |
title_short | Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction |
title_sort | finding low-conductance sets with dense interactions (flcd) for better protein complex prediction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5475323/ https://www.ncbi.nlm.nih.gov/pubmed/28361714 http://dx.doi.org/10.1186/s12918-017-0405-5 |
work_keys_str_mv | AT wangyijie findinglowconductancesetswithdenseinteractionsflcdforbetterproteincomplexprediction AT qianxiaoning findinglowconductancesetswithdenseinteractionsflcdforbetterproteincomplexprediction |