Cargando…

A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks

Most proteins perform their biological functions while interacting as complexes. The detection of protein complexes is an important task not only for understanding the relationship between functions and structures of biological network, but also for predicting the function of unknown proteins. We pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jie, Zheng, Wenping, Qian, Yuhua, Liang, Jiye
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150027/
https://www.ncbi.nlm.nih.gov/pubmed/29292776
http://dx.doi.org/10.3390/molecules22122179
_version_ 1783356919313858560
author Wang, Jie
Zheng, Wenping
Qian, Yuhua
Liang, Jiye
author_facet Wang, Jie
Zheng, Wenping
Qian, Yuhua
Liang, Jiye
author_sort Wang, Jie
collection PubMed
description Most proteins perform their biological functions while interacting as complexes. The detection of protein complexes is an important task not only for understanding the relationship between functions and structures of biological network, but also for predicting the function of unknown proteins. We present a new nodal metric by integrating its local topological information. The metric reflects its representability in a larger local neighborhood to a cluster of a protein interaction (PPI) network. Based on the metric, we propose a seed-expansion graph clustering algorithm (SEGC) for protein complexes detection in PPI networks. A roulette wheel strategy is used in the selection of the seed to enhance the diversity of clustering. For a candidate node u, we define its closeness to a cluster C, denoted as NC(u, C), by combing the density of a cluster C and the connection between a node u and C. In SEGC, a cluster which initially consists of only a seed node, is extended by adding nodes recursively from its neighbors according to the closeness, until all neighbors fail the process of expansion. We compare the F-measure and accuracy of the proposed SEGC algorithm with other algorithms on Saccharomyces cerevisiae protein interaction networks. The experimental results show that SEGC outperforms other algorithms under full coverage.
format Online
Article
Text
id pubmed-6150027
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-61500272018-11-13 A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks Wang, Jie Zheng, Wenping Qian, Yuhua Liang, Jiye Molecules Article Most proteins perform their biological functions while interacting as complexes. The detection of protein complexes is an important task not only for understanding the relationship between functions and structures of biological network, but also for predicting the function of unknown proteins. We present a new nodal metric by integrating its local topological information. The metric reflects its representability in a larger local neighborhood to a cluster of a protein interaction (PPI) network. Based on the metric, we propose a seed-expansion graph clustering algorithm (SEGC) for protein complexes detection in PPI networks. A roulette wheel strategy is used in the selection of the seed to enhance the diversity of clustering. For a candidate node u, we define its closeness to a cluster C, denoted as NC(u, C), by combing the density of a cluster C and the connection between a node u and C. In SEGC, a cluster which initially consists of only a seed node, is extended by adding nodes recursively from its neighbors according to the closeness, until all neighbors fail the process of expansion. We compare the F-measure and accuracy of the proposed SEGC algorithm with other algorithms on Saccharomyces cerevisiae protein interaction networks. The experimental results show that SEGC outperforms other algorithms under full coverage. MDPI 2017-12-08 /pmc/articles/PMC6150027/ /pubmed/29292776 http://dx.doi.org/10.3390/molecules22122179 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Jie
Zheng, Wenping
Qian, Yuhua
Liang, Jiye
A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks
title A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks
title_full A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks
title_fullStr A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks
title_full_unstemmed A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks
title_short A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks
title_sort seed expansion graph clustering method for protein complexes detection in protein interaction networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150027/
https://www.ncbi.nlm.nih.gov/pubmed/29292776
http://dx.doi.org/10.3390/molecules22122179
work_keys_str_mv AT wangjie aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks
AT zhengwenping aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks
AT qianyuhua aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks
AT liangjiye aseedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks
AT wangjie seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks
AT zhengwenping seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks
AT qianyuhua seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks
AT liangjiye seedexpansiongraphclusteringmethodforproteincomplexesdetectioninproteininteractionnetworks