Cargando…
A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks
Most proteins perform their biological functions while interacting as complexes. The detection of protein complexes is an important task not only for understanding the relationship between functions and structures of biological network, but also for predicting the function of unknown proteins. We pr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150027/ https://www.ncbi.nlm.nih.gov/pubmed/29292776 http://dx.doi.org/10.3390/molecules22122179 |
_version_ | 1783356919313858560 |
---|---|
author | Wang, Jie Zheng, Wenping Qian, Yuhua Liang, Jiye |
author_facet | Wang, Jie Zheng, Wenping Qian, Yuhua Liang, Jiye |
author_sort | Wang, Jie |
collection | PubMed |
description | Most proteins perform their biological functions while interacting as complexes. The detection of protein complexes is an important task not only for understanding the relationship between functions and structures of biological network, but also for predicting the function of unknown proteins. We present a new nodal metric by integrating its local topological information. The metric reflects its representability in a larger local neighborhood to a cluster of a protein interaction (PPI) network. Based on the metric, we propose a seed-expansion graph clustering algorithm (SEGC) for protein complexes detection in PPI networks. A roulette wheel strategy is used in the selection of the seed to enhance the diversity of clustering. For a candidate node u, we define its closeness to a cluster C, denoted as NC(u, C), by combing the density of a cluster C and the connection between a node u and C. In SEGC, a cluster which initially consists of only a seed node, is extended by adding nodes recursively from its neighbors according to the closeness, until all neighbors fail the process of expansion. We compare the F-measure and accuracy of the proposed SEGC algorithm with other algorithms on Saccharomyces cerevisiae protein interaction networks. The experimental results show that SEGC outperforms other algorithms under full coverage. |
format | Online Article Text |
id | pubmed-6150027 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-61500272018-11-13 A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks Wang, Jie Zheng, Wenping Qian, Yuhua Liang, Jiye Molecules Article Most proteins perform their biological functions while interacting as complexes. The detection of protein complexes is an important task not only for understanding the relationship between functions and structures of biological network, but also for predicting the function of unknown proteins. We present a new nodal metric by integrating its local topological information. The metric reflects its representability in a larger local neighborhood to a cluster of a protein interaction (PPI) network. Based on the metric, we propose a seed-expansion graph clustering algorithm (SEGC) for protein complexes detection in PPI networks. A roulette wheel strategy is used in the selection of the seed to enhance the diversity of clustering. For a candidate node u, we define its closeness to a cluster C, denoted as NC(u, C), by combing the density of a cluster C and the connection between a node u and C. In SEGC, a cluster which initially consists of only a seed node, is extended by adding nodes recursively from its neighbors according to the closeness, until all neighbors fail the process of expansion. We compare the F-measure and accuracy of the proposed SEGC algorithm with other algorithms on Saccharomyces cerevisiae protein interaction networks. The experimental results show that SEGC outperforms other algorithms under full coverage. MDPI 2017-12-08 /pmc/articles/PMC6150027/ /pubmed/29292776 http://dx.doi.org/10.3390/molecules22122179 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Jie Zheng, Wenping Qian, Yuhua Liang, Jiye A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks |
title | A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks |
title_full | A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks |
title_fullStr | A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks |
title_full_unstemmed | A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks |
title_short | A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks |
title_sort | seed expansion graph clustering method for protein complexes detection in protein interaction networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150027/ https://www.ncbi.nlm.nih.gov/pubmed/29292776 http://dx.doi.org/10.3390/molecules22122179 |
work_keys_str_mv | AT wangjie aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks AT zhengwenping aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks AT qianyuhua aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks AT liangjiye aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks AT wangjie seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks AT zhengwenping seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks AT qianyuhua seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks AT liangjiye seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks |