Cargando…

Identifying protein complexes based on density and modularity in protein-protein interaction network

BACKGROUND: Identifying protein complexes is crucial to understanding principles of cellular organization and functional mechanisms. As many evidences have indicated that the subgraphs with high density or with high modularity in PPI network usually correspond to protein complexes, protein complexes...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Jun, Wang, Jianxin, Li, Min, Wang, Lusheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854919/
https://www.ncbi.nlm.nih.gov/pubmed/24565048
http://dx.doi.org/10.1186/1752-0509-7-S4-S12
_version_ 1782294876215312384
author Ren, Jun
Wang, Jianxin
Li, Min
Wang, Lusheng
author_facet Ren, Jun
Wang, Jianxin
Li, Min
Wang, Lusheng
author_sort Ren, Jun
collection PubMed
description BACKGROUND: Identifying protein complexes is crucial to understanding principles of cellular organization and functional mechanisms. As many evidences have indicated that the subgraphs with high density or with high modularity in PPI network usually correspond to protein complexes, protein complexes detection methods based on PPI network focused on subgraph's density or its modularity in PPI network. However, dense subgraphs may have low modularity and subgraph with high modularity may have low density, which results that protein complexes may be subgraphs with low modularity or with low density in the PPI network. As the density-based methods are difficult to mine protein complexes with low density, and the modularity-based methods are difficult to mine protein complexes with low modularity, both two methods have limitation for identifying protein complexes with various density and modularity. RESULTS: To identify protein complexes with various density and modularity, including those have low density but high modularity and those have low modularity but high density, we define a novel subgraph's fitness, f(ρ), as f(ρ)= (density)(ρ)*(modularity)(1-ρ), and propose a novel algorithm, named LF_PIN, to identify protein complexes by expanding seed edges to subgraphs with the local maximum fitness value. Experimental results of LF-PIN in S.cerevisiae show that compared with the results of fitness equal to density (ρ = 1) or equal to modularity (ρ = 0), the LF-PIN identifies known protein complexes more effectively when the fitness value is decided by both density and modularity (0<ρ<1). Compared with the results of seven competing protein complex detection methods (CMC, Core-Attachment, CPM, DPClus, HC-PIN, MCL, and NFC) in S.cerevisiae and E.coli, LF-PIN outperforms other seven methods in terms of matching with known complexes and functional enrichment. Moreover, LF-PIN has better performance in identifying protein complexes with low density or with low modularity. CONCLUSIONS: By considering both the density and the modularity, LF-PIN outperforms other protein complexes detection methods that only consider density or modularity, especially in identifying known protein complexes with low density or low modularity.
format Online
Article
Text
id pubmed-3854919
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38549192013-12-16 Identifying protein complexes based on density and modularity in protein-protein interaction network Ren, Jun Wang, Jianxin Li, Min Wang, Lusheng BMC Syst Biol Research BACKGROUND: Identifying protein complexes is crucial to understanding principles of cellular organization and functional mechanisms. As many evidences have indicated that the subgraphs with high density or with high modularity in PPI network usually correspond to protein complexes, protein complexes detection methods based on PPI network focused on subgraph's density or its modularity in PPI network. However, dense subgraphs may have low modularity and subgraph with high modularity may have low density, which results that protein complexes may be subgraphs with low modularity or with low density in the PPI network. As the density-based methods are difficult to mine protein complexes with low density, and the modularity-based methods are difficult to mine protein complexes with low modularity, both two methods have limitation for identifying protein complexes with various density and modularity. RESULTS: To identify protein complexes with various density and modularity, including those have low density but high modularity and those have low modularity but high density, we define a novel subgraph's fitness, f(ρ), as f(ρ)= (density)(ρ)*(modularity)(1-ρ), and propose a novel algorithm, named LF_PIN, to identify protein complexes by expanding seed edges to subgraphs with the local maximum fitness value. Experimental results of LF-PIN in S.cerevisiae show that compared with the results of fitness equal to density (ρ = 1) or equal to modularity (ρ = 0), the LF-PIN identifies known protein complexes more effectively when the fitness value is decided by both density and modularity (0<ρ<1). Compared with the results of seven competing protein complex detection methods (CMC, Core-Attachment, CPM, DPClus, HC-PIN, MCL, and NFC) in S.cerevisiae and E.coli, LF-PIN outperforms other seven methods in terms of matching with known complexes and functional enrichment. Moreover, LF-PIN has better performance in identifying protein complexes with low density or with low modularity. CONCLUSIONS: By considering both the density and the modularity, LF-PIN outperforms other protein complexes detection methods that only consider density or modularity, especially in identifying known protein complexes with low density or low modularity. BioMed Central 2013-10-23 /pmc/articles/PMC3854919/ /pubmed/24565048 http://dx.doi.org/10.1186/1752-0509-7-S4-S12 Text en Copyright © 2013 Ren et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Ren, Jun
Wang, Jianxin
Li, Min
Wang, Lusheng
Identifying protein complexes based on density and modularity in protein-protein interaction network
title Identifying protein complexes based on density and modularity in protein-protein interaction network
title_full Identifying protein complexes based on density and modularity in protein-protein interaction network
title_fullStr Identifying protein complexes based on density and modularity in protein-protein interaction network
title_full_unstemmed Identifying protein complexes based on density and modularity in protein-protein interaction network
title_short Identifying protein complexes based on density and modularity in protein-protein interaction network
title_sort identifying protein complexes based on density and modularity in protein-protein interaction network
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854919/
https://www.ncbi.nlm.nih.gov/pubmed/24565048
http://dx.doi.org/10.1186/1752-0509-7-S4-S12
work_keys_str_mv AT renjun identifyingproteincomplexesbasedondensityandmodularityinproteinproteininteractionnetwork
AT wangjianxin identifyingproteincomplexesbasedondensityandmodularityinproteinproteininteractionnetwork
AT limin identifyingproteincomplexesbasedondensityandmodularityinproteinproteininteractionnetwork
AT wanglusheng identifyingproteincomplexesbasedondensityandmodularityinproteinproteininteractionnetwork