Cargando…

An effective approach to detecting both small and large complexes from protein-protein interaction networks

BACKGROUND: Predicting protein complexes from protein-protein interaction (PPI) networks has been studied for decade. Various methods have been proposed to address some challenging issues of this problem, including overlapping clusters, high false positive/negative rates of PPI data and diverse comp...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Bin, Wang, Yang, Wang, Zewei, Zhou, Jiaogen, Zhou, Shuigeng, Guan, Jihong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5657047/
https://www.ncbi.nlm.nih.gov/pubmed/29072136
http://dx.doi.org/10.1186/s12859-017-1820-8
Descripción
Sumario:BACKGROUND: Predicting protein complexes from protein-protein interaction (PPI) networks has been studied for decade. Various methods have been proposed to address some challenging issues of this problem, including overlapping clusters, high false positive/negative rates of PPI data and diverse complex structures. It is well known that most current methods can detect effectively only complexes of size ≥3, which account for only about half of the total existing complexes. Recently, a method was proposed specifically for finding small complexes (size = 2 and 3) from PPI networks. However, up to now there is no effective approach that can predict both small (size ≤ 3) and large (size >3) complexes from PPI networks. RESULTS: In this paper, we propose a novel method, called CPredictor2.0, that can detect both small and large complexes under a unified framework. Concretely, we first group proteins of similar functions. Then, the Markov clustering algorithm is employed to discover clusters in each group. Finally, we merge all discovered clusters that overlap with each other to a certain degree, and the merged clusters as well as the remaining clusters constitute the set of detected complexes. Extensive experiments have shown that the new method can more effectively predict both small and large complexes, in comparison with the state-of-the-art methods. CONCLUSIONS: The proposed method, CPredictor2.0, can be applied to accurately predict both small and large protein complexes.