Cargando…

Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks

In this paper, we present a novel rough-fuzzy clustering (RFC) method to detect overlapping protein complexes in protein-protein interaction (PPI) networks. RFC focuses on fuzzy relation model rather than graph model by integrating fuzzy sets and rough sets, employs the upper and lower approximation...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Hao, Gao, Lin, Dong, Jihua, Yang, Xiaofei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3958373/
https://www.ncbi.nlm.nih.gov/pubmed/24642838
http://dx.doi.org/10.1371/journal.pone.0091856
_version_ 1782307862743089152
author Wu, Hao
Gao, Lin
Dong, Jihua
Yang, Xiaofei
author_facet Wu, Hao
Gao, Lin
Dong, Jihua
Yang, Xiaofei
author_sort Wu, Hao
collection PubMed
description In this paper, we present a novel rough-fuzzy clustering (RFC) method to detect overlapping protein complexes in protein-protein interaction (PPI) networks. RFC focuses on fuzzy relation model rather than graph model by integrating fuzzy sets and rough sets, employs the upper and lower approximations of rough sets to deal with overlapping complexes, and calculates the number of complexes automatically. Fuzzy relation between proteins is established and then transformed into fuzzy equivalence relation. Non-overlapping complexes correspond to equivalence classes satisfying certain equivalence relation. To obtain overlapping complexes, we calculate the similarity between one protein and each complex, and then determine whether the protein belongs to one or multiple complexes by computing the ratio of each similarity to maximum similarity. To validate RFC quantitatively, we test it in Gavin, Collins, Krogan and BioGRID datasets. Experiment results show that there is a good correspondence to reference complexes in MIPS and SGD databases. Then we compare RFC with several previous methods, including ClusterONE, CMC, MCL, GCE, OSLOM and CFinder. Results show the precision, sensitivity and separation are 32.4%, 42.9% and 81.9% higher than mean of the five methods in four weighted networks, and are 0.5%, 11.2% and 66.1% higher than mean of the six methods in five unweighted networks. Our method RFC works well for protein complexes detection and provides a new insight of network division, and it can also be applied to identify overlapping community structure in social networks and LFR benchmark networks.
format Online
Article
Text
id pubmed-3958373
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39583732014-03-24 Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks Wu, Hao Gao, Lin Dong, Jihua Yang, Xiaofei PLoS One Research Article In this paper, we present a novel rough-fuzzy clustering (RFC) method to detect overlapping protein complexes in protein-protein interaction (PPI) networks. RFC focuses on fuzzy relation model rather than graph model by integrating fuzzy sets and rough sets, employs the upper and lower approximations of rough sets to deal with overlapping complexes, and calculates the number of complexes automatically. Fuzzy relation between proteins is established and then transformed into fuzzy equivalence relation. Non-overlapping complexes correspond to equivalence classes satisfying certain equivalence relation. To obtain overlapping complexes, we calculate the similarity between one protein and each complex, and then determine whether the protein belongs to one or multiple complexes by computing the ratio of each similarity to maximum similarity. To validate RFC quantitatively, we test it in Gavin, Collins, Krogan and BioGRID datasets. Experiment results show that there is a good correspondence to reference complexes in MIPS and SGD databases. Then we compare RFC with several previous methods, including ClusterONE, CMC, MCL, GCE, OSLOM and CFinder. Results show the precision, sensitivity and separation are 32.4%, 42.9% and 81.9% higher than mean of the five methods in four weighted networks, and are 0.5%, 11.2% and 66.1% higher than mean of the six methods in five unweighted networks. Our method RFC works well for protein complexes detection and provides a new insight of network division, and it can also be applied to identify overlapping community structure in social networks and LFR benchmark networks. Public Library of Science 2014-03-18 /pmc/articles/PMC3958373/ /pubmed/24642838 http://dx.doi.org/10.1371/journal.pone.0091856 Text en © 2014 Wu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wu, Hao
Gao, Lin
Dong, Jihua
Yang, Xiaofei
Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks
title Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks
title_full Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks
title_fullStr Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks
title_full_unstemmed Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks
title_short Detecting Overlapping Protein Complexes by Rough-Fuzzy Clustering in Protein-Protein Interaction Networks
title_sort detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3958373/
https://www.ncbi.nlm.nih.gov/pubmed/24642838
http://dx.doi.org/10.1371/journal.pone.0091856
work_keys_str_mv AT wuhao detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks
AT gaolin detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks
AT dongjihua detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks
AT yangxiaofei detectingoverlappingproteincomplexesbyroughfuzzyclusteringinproteinproteininteractionnetworks