Cargando…

RRW: repeated random walks on genome-scale protein networks for local cluster discovery

BACKGROUND: We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW) for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of net...

Descripción completa

Detalles Bibliográficos
Autores principales: Macropol, Kathy, Can, Tolga, Singh, Ambuj K
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2748087/
https://www.ncbi.nlm.nih.gov/pubmed/19740439
http://dx.doi.org/10.1186/1471-2105-10-283
_version_ 1782172131069526016
author Macropol, Kathy
Can, Tolga
Singh, Ambuj K
author_facet Macropol, Kathy
Can, Tolga
Singh, Ambuj K
author_sort Macropol, Kathy
collection PubMed
description BACKGROUND: We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW) for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins. RESULTS: We apply the proposed technique on a functional network of yeast genes and accurately identify statistically significant clusters of proteins. We validate the biological significance of the results using known complexes in the MIPS complex catalogue database and well-characterized biological processes. We find that 90% of the created clusters have the majority of their catalogued proteins belonging to the same MIPS complex, and about 80% have the majority of their proteins involved in the same biological process. We compare our method to various other clustering techniques, such as the Markov Clustering Algorithm (MCL), and find a significant improvement in the RRW clusters' precision and accuracy values. CONCLUSION: RRW, which is a technique that exploits the topology of the network, is more precise and robust in finding local clusters. In addition, it has the added flexibility of being able to find multi-functional proteins by allowing overlapping clusters.
format Text
id pubmed-2748087
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27480872009-09-22 RRW: repeated random walks on genome-scale protein networks for local cluster discovery Macropol, Kathy Can, Tolga Singh, Ambuj K BMC Bioinformatics Research Article BACKGROUND: We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW) for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins. RESULTS: We apply the proposed technique on a functional network of yeast genes and accurately identify statistically significant clusters of proteins. We validate the biological significance of the results using known complexes in the MIPS complex catalogue database and well-characterized biological processes. We find that 90% of the created clusters have the majority of their catalogued proteins belonging to the same MIPS complex, and about 80% have the majority of their proteins involved in the same biological process. We compare our method to various other clustering techniques, such as the Markov Clustering Algorithm (MCL), and find a significant improvement in the RRW clusters' precision and accuracy values. CONCLUSION: RRW, which is a technique that exploits the topology of the network, is more precise and robust in finding local clusters. In addition, it has the added flexibility of being able to find multi-functional proteins by allowing overlapping clusters. BioMed Central 2009-09-09 /pmc/articles/PMC2748087/ /pubmed/19740439 http://dx.doi.org/10.1186/1471-2105-10-283 Text en Copyright © 2009 Macropol et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Macropol, Kathy
Can, Tolga
Singh, Ambuj K
RRW: repeated random walks on genome-scale protein networks for local cluster discovery
title RRW: repeated random walks on genome-scale protein networks for local cluster discovery
title_full RRW: repeated random walks on genome-scale protein networks for local cluster discovery
title_fullStr RRW: repeated random walks on genome-scale protein networks for local cluster discovery
title_full_unstemmed RRW: repeated random walks on genome-scale protein networks for local cluster discovery
title_short RRW: repeated random walks on genome-scale protein networks for local cluster discovery
title_sort rrw: repeated random walks on genome-scale protein networks for local cluster discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2748087/
https://www.ncbi.nlm.nih.gov/pubmed/19740439
http://dx.doi.org/10.1186/1471-2105-10-283
work_keys_str_mv AT macropolkathy rrwrepeatedrandomwalksongenomescaleproteinnetworksforlocalclusterdiscovery
AT cantolga rrwrepeatedrandomwalksongenomescaleproteinnetworksforlocalclusterdiscovery
AT singhambujk rrwrepeatedrandomwalksongenomescaleproteinnetworksforlocalclusterdiscovery