Cargando…

Modifying the DPClus algorithm for identifying protein complexes based on new topological structures

BACKGROUND: Identification of protein complexes is crucial for understanding principles of cellular organization and functions. As the size of protein-protein interaction set increases, a general trend is to represent the interactions as a network and to develop effective algorithms to detect signif...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Min, Chen, Jian-er, Wang, Jian-xin, Hu, Bin, Chen, Gang
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2570695/
https://www.ncbi.nlm.nih.gov/pubmed/18816408
http://dx.doi.org/10.1186/1471-2105-9-398
_version_ 1782160170044882944
author Li, Min
Chen, Jian-er
Wang, Jian-xin
Hu, Bin
Chen, Gang
author_facet Li, Min
Chen, Jian-er
Wang, Jian-xin
Hu, Bin
Chen, Gang
author_sort Li, Min
collection PubMed
description BACKGROUND: Identification of protein complexes is crucial for understanding principles of cellular organization and functions. As the size of protein-protein interaction set increases, a general trend is to represent the interactions as a network and to develop effective algorithms to detect significant complexes in such networks. RESULTS: Based on the study of known complexes in protein networks, this paper proposes a new topological structure for protein complexes, which is a combination of subgraph diameter (or average vertex distance) and subgraph density. Following the approach of that of the previously proposed clustering algorithm DPClus which expands clusters starting from seeded vertices, we present a clustering algorithm IPCA based on the new topological structure for identifying complexes in large protein interaction networks. The algorithm IPCA is applied to the protein interaction network of Sacchromyces cerevisiae and identifies many well known complexes. Experimental results show that the algorithm IPCA recalls more known complexes than previously proposed clustering algorithms, including DPClus, CFinder, LCMA, MCODE, RNSC and STM. CONCLUSION: The proposed algorithm based on the new topological structure makes it possible to identify dense subgraphs in protein interaction networks, many of which correspond to known protein complexes. The algorithm is robust to the known high rate of false positives and false negatives in data from high-throughout interaction techniques. The program is available at .
format Text
id pubmed-2570695
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25706952008-10-22 Modifying the DPClus algorithm for identifying protein complexes based on new topological structures Li, Min Chen, Jian-er Wang, Jian-xin Hu, Bin Chen, Gang BMC Bioinformatics Methodology Article BACKGROUND: Identification of protein complexes is crucial for understanding principles of cellular organization and functions. As the size of protein-protein interaction set increases, a general trend is to represent the interactions as a network and to develop effective algorithms to detect significant complexes in such networks. RESULTS: Based on the study of known complexes in protein networks, this paper proposes a new topological structure for protein complexes, which is a combination of subgraph diameter (or average vertex distance) and subgraph density. Following the approach of that of the previously proposed clustering algorithm DPClus which expands clusters starting from seeded vertices, we present a clustering algorithm IPCA based on the new topological structure for identifying complexes in large protein interaction networks. The algorithm IPCA is applied to the protein interaction network of Sacchromyces cerevisiae and identifies many well known complexes. Experimental results show that the algorithm IPCA recalls more known complexes than previously proposed clustering algorithms, including DPClus, CFinder, LCMA, MCODE, RNSC and STM. CONCLUSION: The proposed algorithm based on the new topological structure makes it possible to identify dense subgraphs in protein interaction networks, many of which correspond to known protein complexes. The algorithm is robust to the known high rate of false positives and false negatives in data from high-throughout interaction techniques. The program is available at . BioMed Central 2008-09-25 /pmc/articles/PMC2570695/ /pubmed/18816408 http://dx.doi.org/10.1186/1471-2105-9-398 Text en Copyright © 2008 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Li, Min
Chen, Jian-er
Wang, Jian-xin
Hu, Bin
Chen, Gang
Modifying the DPClus algorithm for identifying protein complexes based on new topological structures
title Modifying the DPClus algorithm for identifying protein complexes based on new topological structures
title_full Modifying the DPClus algorithm for identifying protein complexes based on new topological structures
title_fullStr Modifying the DPClus algorithm for identifying protein complexes based on new topological structures
title_full_unstemmed Modifying the DPClus algorithm for identifying protein complexes based on new topological structures
title_short Modifying the DPClus algorithm for identifying protein complexes based on new topological structures
title_sort modifying the dpclus algorithm for identifying protein complexes based on new topological structures
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2570695/
https://www.ncbi.nlm.nih.gov/pubmed/18816408
http://dx.doi.org/10.1186/1471-2105-9-398
work_keys_str_mv AT limin modifyingthedpclusalgorithmforidentifyingproteincomplexesbasedonnewtopologicalstructures
AT chenjianer modifyingthedpclusalgorithmforidentifyingproteincomplexesbasedonnewtopologicalstructures
AT wangjianxin modifyingthedpclusalgorithmforidentifyingproteincomplexesbasedonnewtopologicalstructures
AT hubin modifyingthedpclusalgorithmforidentifyingproteincomplexesbasedonnewtopologicalstructures
AT chengang modifyingthedpclusalgorithmforidentifyingproteincomplexesbasedonnewtopologicalstructures