Cargando…

Predicting overlapping protein complexes based on core-attachment and a local modularity structure

BACKGROUND: In recent decades, detecting protein complexes (PCs) from protein-protein interaction networks (PPINs) has been an active area of research. There are a large number of excellent graph clustering methods that work very well for identifying PCs. However, most of existing methods usually ov...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Rongquan, Liu, Guixia, Wang, Caixia, Su, Lingtao, Sun, Liyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106838/
https://www.ncbi.nlm.nih.gov/pubmed/30134824
http://dx.doi.org/10.1186/s12859-018-2309-9
_version_ 1783349854745919488
author Wang, Rongquan
Liu, Guixia
Wang, Caixia
Su, Lingtao
Sun, Liyan
author_facet Wang, Rongquan
Liu, Guixia
Wang, Caixia
Su, Lingtao
Sun, Liyan
author_sort Wang, Rongquan
collection PubMed
description BACKGROUND: In recent decades, detecting protein complexes (PCs) from protein-protein interaction networks (PPINs) has been an active area of research. There are a large number of excellent graph clustering methods that work very well for identifying PCs. However, most of existing methods usually overlook the inherent core-attachment organization of PCs. Therefore, these methods have three major limitations we should concern. Firstly, many methods have ignored the importance of selecting seed, especially without considering the impact of overlapping nodes as seed nodes. Thus, there may be false predictions. Secondly, PCs are generally supposed to be dense subgraphs. However, the subgraphs with high local modularity structure usually correspond to PCs. Thirdly, a number of available methods lack handling noise mechanism, and miss some peripheral proteins. In summary, all these challenging issues are very important for predicting more biological overlapping PCs. RESULTS: In this paper, to overcome these weaknesses, we propose a clustering method by core-attachment and local modularity structure, named CALM, to detect overlapping PCs from weighted PPINs with noises. Firstly, we identify overlapping nodes and seed nodes. Secondly, for a node, we calculate the support function between a node and a cluster. In CALM, a cluster which initially consists of only a seed node, is extended by adding its direct neighboring nodes recursively according to the support function, until this cluster forms a locally optimal modularity subgraph. Thirdly, we repeat this process for the remaining seed nodes. Finally, merging and removing procedures are carried out to obtain final predicted clusters. The experimental results show that CALM outperforms other classical methods, and achieves ideal overall performance. Furthermore, CALM can match more complexes with a higher accuracy and provide a better one-to-one mapping with reference complexes in all test datasets. Additionally, CALM is robust against the high rate of noise PPIN. CONCLUSIONS: By considering core-attachment and local modularity structure, CALM could detect PCs much more effectively than some representative methods. In short, CALM could potentially identify previous undiscovered overlapping PCs with various density and high modularity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2309-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6106838
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61068382018-08-29 Predicting overlapping protein complexes based on core-attachment and a local modularity structure Wang, Rongquan Liu, Guixia Wang, Caixia Su, Lingtao Sun, Liyan BMC Bioinformatics Methodology Article BACKGROUND: In recent decades, detecting protein complexes (PCs) from protein-protein interaction networks (PPINs) has been an active area of research. There are a large number of excellent graph clustering methods that work very well for identifying PCs. However, most of existing methods usually overlook the inherent core-attachment organization of PCs. Therefore, these methods have three major limitations we should concern. Firstly, many methods have ignored the importance of selecting seed, especially without considering the impact of overlapping nodes as seed nodes. Thus, there may be false predictions. Secondly, PCs are generally supposed to be dense subgraphs. However, the subgraphs with high local modularity structure usually correspond to PCs. Thirdly, a number of available methods lack handling noise mechanism, and miss some peripheral proteins. In summary, all these challenging issues are very important for predicting more biological overlapping PCs. RESULTS: In this paper, to overcome these weaknesses, we propose a clustering method by core-attachment and local modularity structure, named CALM, to detect overlapping PCs from weighted PPINs with noises. Firstly, we identify overlapping nodes and seed nodes. Secondly, for a node, we calculate the support function between a node and a cluster. In CALM, a cluster which initially consists of only a seed node, is extended by adding its direct neighboring nodes recursively according to the support function, until this cluster forms a locally optimal modularity subgraph. Thirdly, we repeat this process for the remaining seed nodes. Finally, merging and removing procedures are carried out to obtain final predicted clusters. The experimental results show that CALM outperforms other classical methods, and achieves ideal overall performance. Furthermore, CALM can match more complexes with a higher accuracy and provide a better one-to-one mapping with reference complexes in all test datasets. Additionally, CALM is robust against the high rate of noise PPIN. CONCLUSIONS: By considering core-attachment and local modularity structure, CALM could detect PCs much more effectively than some representative methods. In short, CALM could potentially identify previous undiscovered overlapping PCs with various density and high modularity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2309-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-22 /pmc/articles/PMC6106838/ /pubmed/30134824 http://dx.doi.org/10.1186/s12859-018-2309-9 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wang, Rongquan
Liu, Guixia
Wang, Caixia
Su, Lingtao
Sun, Liyan
Predicting overlapping protein complexes based on core-attachment and a local modularity structure
title Predicting overlapping protein complexes based on core-attachment and a local modularity structure
title_full Predicting overlapping protein complexes based on core-attachment and a local modularity structure
title_fullStr Predicting overlapping protein complexes based on core-attachment and a local modularity structure
title_full_unstemmed Predicting overlapping protein complexes based on core-attachment and a local modularity structure
title_short Predicting overlapping protein complexes based on core-attachment and a local modularity structure
title_sort predicting overlapping protein complexes based on core-attachment and a local modularity structure
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106838/
https://www.ncbi.nlm.nih.gov/pubmed/30134824
http://dx.doi.org/10.1186/s12859-018-2309-9
work_keys_str_mv AT wangrongquan predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure
AT liuguixia predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure
AT wangcaixia predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure
AT sulingtao predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure
AT sunliyan predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure