Cargando…
Predicting overlapping protein complexes based on core-attachment and a local modularity structure
BACKGROUND: In recent decades, detecting protein complexes (PCs) from protein-protein interaction networks (PPINs) has been an active area of research. There are a large number of excellent graph clustering methods that work very well for identifying PCs. However, most of existing methods usually ov...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106838/ https://www.ncbi.nlm.nih.gov/pubmed/30134824 http://dx.doi.org/10.1186/s12859-018-2309-9 |
_version_ | 1783349854745919488 |
---|---|
author | Wang, Rongquan Liu, Guixia Wang, Caixia Su, Lingtao Sun, Liyan |
author_facet | Wang, Rongquan Liu, Guixia Wang, Caixia Su, Lingtao Sun, Liyan |
author_sort | Wang, Rongquan |
collection | PubMed |
description | BACKGROUND: In recent decades, detecting protein complexes (PCs) from protein-protein interaction networks (PPINs) has been an active area of research. There are a large number of excellent graph clustering methods that work very well for identifying PCs. However, most of existing methods usually overlook the inherent core-attachment organization of PCs. Therefore, these methods have three major limitations we should concern. Firstly, many methods have ignored the importance of selecting seed, especially without considering the impact of overlapping nodes as seed nodes. Thus, there may be false predictions. Secondly, PCs are generally supposed to be dense subgraphs. However, the subgraphs with high local modularity structure usually correspond to PCs. Thirdly, a number of available methods lack handling noise mechanism, and miss some peripheral proteins. In summary, all these challenging issues are very important for predicting more biological overlapping PCs. RESULTS: In this paper, to overcome these weaknesses, we propose a clustering method by core-attachment and local modularity structure, named CALM, to detect overlapping PCs from weighted PPINs with noises. Firstly, we identify overlapping nodes and seed nodes. Secondly, for a node, we calculate the support function between a node and a cluster. In CALM, a cluster which initially consists of only a seed node, is extended by adding its direct neighboring nodes recursively according to the support function, until this cluster forms a locally optimal modularity subgraph. Thirdly, we repeat this process for the remaining seed nodes. Finally, merging and removing procedures are carried out to obtain final predicted clusters. The experimental results show that CALM outperforms other classical methods, and achieves ideal overall performance. Furthermore, CALM can match more complexes with a higher accuracy and provide a better one-to-one mapping with reference complexes in all test datasets. Additionally, CALM is robust against the high rate of noise PPIN. CONCLUSIONS: By considering core-attachment and local modularity structure, CALM could detect PCs much more effectively than some representative methods. In short, CALM could potentially identify previous undiscovered overlapping PCs with various density and high modularity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2309-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6106838 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61068382018-08-29 Predicting overlapping protein complexes based on core-attachment and a local modularity structure Wang, Rongquan Liu, Guixia Wang, Caixia Su, Lingtao Sun, Liyan BMC Bioinformatics Methodology Article BACKGROUND: In recent decades, detecting protein complexes (PCs) from protein-protein interaction networks (PPINs) has been an active area of research. There are a large number of excellent graph clustering methods that work very well for identifying PCs. However, most of existing methods usually overlook the inherent core-attachment organization of PCs. Therefore, these methods have three major limitations we should concern. Firstly, many methods have ignored the importance of selecting seed, especially without considering the impact of overlapping nodes as seed nodes. Thus, there may be false predictions. Secondly, PCs are generally supposed to be dense subgraphs. However, the subgraphs with high local modularity structure usually correspond to PCs. Thirdly, a number of available methods lack handling noise mechanism, and miss some peripheral proteins. In summary, all these challenging issues are very important for predicting more biological overlapping PCs. RESULTS: In this paper, to overcome these weaknesses, we propose a clustering method by core-attachment and local modularity structure, named CALM, to detect overlapping PCs from weighted PPINs with noises. Firstly, we identify overlapping nodes and seed nodes. Secondly, for a node, we calculate the support function between a node and a cluster. In CALM, a cluster which initially consists of only a seed node, is extended by adding its direct neighboring nodes recursively according to the support function, until this cluster forms a locally optimal modularity subgraph. Thirdly, we repeat this process for the remaining seed nodes. Finally, merging and removing procedures are carried out to obtain final predicted clusters. The experimental results show that CALM outperforms other classical methods, and achieves ideal overall performance. Furthermore, CALM can match more complexes with a higher accuracy and provide a better one-to-one mapping with reference complexes in all test datasets. Additionally, CALM is robust against the high rate of noise PPIN. CONCLUSIONS: By considering core-attachment and local modularity structure, CALM could detect PCs much more effectively than some representative methods. In short, CALM could potentially identify previous undiscovered overlapping PCs with various density and high modularity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2309-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-22 /pmc/articles/PMC6106838/ /pubmed/30134824 http://dx.doi.org/10.1186/s12859-018-2309-9 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Wang, Rongquan Liu, Guixia Wang, Caixia Su, Lingtao Sun, Liyan Predicting overlapping protein complexes based on core-attachment and a local modularity structure |
title | Predicting overlapping protein complexes based on core-attachment and a local modularity structure |
title_full | Predicting overlapping protein complexes based on core-attachment and a local modularity structure |
title_fullStr | Predicting overlapping protein complexes based on core-attachment and a local modularity structure |
title_full_unstemmed | Predicting overlapping protein complexes based on core-attachment and a local modularity structure |
title_short | Predicting overlapping protein complexes based on core-attachment and a local modularity structure |
title_sort | predicting overlapping protein complexes based on core-attachment and a local modularity structure |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106838/ https://www.ncbi.nlm.nih.gov/pubmed/30134824 http://dx.doi.org/10.1186/s12859-018-2309-9 |
work_keys_str_mv | AT wangrongquan predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure AT liuguixia predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure AT wangcaixia predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure AT sulingtao predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure AT sunliyan predictingoverlappingproteincomplexesbasedoncoreattachmentandalocalmodularitystructure |