Cargando…

Efficient and accurate greedy search methods for mining functional modules in protein interaction networks

BACKGROUND: Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally dete...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Jieyue, Li, Chaojun, Ye, Baoliu, Zhong, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3314584/
https://www.ncbi.nlm.nih.gov/pubmed/22759424
http://dx.doi.org/10.1186/1471-2105-13-S10-S19
_version_ 1782228106058137600
author He, Jieyue
Li, Chaojun
Ye, Baoliu
Zhong, Wei
author_facet He, Jieyue
Li, Chaojun
Ye, Baoliu
Zhong, Wei
author_sort He, Jieyue
collection PubMed
description BACKGROUND: Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. METHODS: In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. RESULTS: The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. CONCLUSIONS: Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the computational time significantly while keeping high prediction accuracy.
format Online
Article
Text
id pubmed-3314584
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33145842012-04-02 Efficient and accurate greedy search methods for mining functional modules in protein interaction networks He, Jieyue Li, Chaojun Ye, Baoliu Zhong, Wei BMC Bioinformatics Proceedings BACKGROUND: Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. METHODS: In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. RESULTS: The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. CONCLUSIONS: Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the computational time significantly while keeping high prediction accuracy. BioMed Central 2012-06-25 /pmc/articles/PMC3314584/ /pubmed/22759424 http://dx.doi.org/10.1186/1471-2105-13-S10-S19 Text en Copyright ©2012 He et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
He, Jieyue
Li, Chaojun
Ye, Baoliu
Zhong, Wei
Efficient and accurate greedy search methods for mining functional modules in protein interaction networks
title Efficient and accurate greedy search methods for mining functional modules in protein interaction networks
title_full Efficient and accurate greedy search methods for mining functional modules in protein interaction networks
title_fullStr Efficient and accurate greedy search methods for mining functional modules in protein interaction networks
title_full_unstemmed Efficient and accurate greedy search methods for mining functional modules in protein interaction networks
title_short Efficient and accurate greedy search methods for mining functional modules in protein interaction networks
title_sort efficient and accurate greedy search methods for mining functional modules in protein interaction networks
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3314584/
https://www.ncbi.nlm.nih.gov/pubmed/22759424
http://dx.doi.org/10.1186/1471-2105-13-S10-S19
work_keys_str_mv AT hejieyue efficientandaccurategreedysearchmethodsforminingfunctionalmodulesinproteininteractionnetworks
AT lichaojun efficientandaccurategreedysearchmethodsforminingfunctionalmodulesinproteininteractionnetworks
AT yebaoliu efficientandaccurategreedysearchmethodsforminingfunctionalmodulesinproteininteractionnetworks
AT zhongwei efficientandaccurategreedysearchmethodsforminingfunctionalmodulesinproteininteractionnetworks