Cargando…

SPICi: a fast clustering algorithm for large biological networks

Motivation: Clustering algorithms play an important role in the analysis of biological networks, and can be used to uncover functional modules and obtain hints about cellular organization. While most available clustering algorithms work well on biological networks of moderate size, such as the yeast...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jiang, Peng, Singh, Mona
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2010
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853685/ https://www.ncbi.nlm.nih.gov/pubmed/20185405 http://dx.doi.org/10.1093/bioinformatics/btq078

_version_	1782180052166770688
author	Jiang, Peng Singh, Mona
author_facet	Jiang, Peng Singh, Mona
author_sort	Jiang, Peng
collection	PubMed
description	Motivation: Clustering algorithms play an important role in the analysis of biological networks, and can be used to uncover functional modules and obtain hints about cellular organization. While most available clustering algorithms work well on biological networks of moderate size, such as the yeast protein physical interaction network, they either fail or are too slow in practice for larger networks, such as functional networks for higher eukaryotes. Since an increasing number of larger biological networks are being determined, the limitations of current clustering approaches curtail the types of biological network analyses that can be performed. Results: We present a fast local network clustering algorithm SPICi. SPICi runs in time O(V log V+E) and space O(E), where V and E are the number of vertices and edges in the network, respectively. We evaluate SPICi's performance on several existing protein interaction networks of varying size, and compare SPICi to nine previous approaches for clustering biological networks. We show that SPICi is typically several orders of magnitude faster than previous approaches and is the only one that can successfully cluster all test networks within very short time. We demonstrate that SPICi has state-of-the-art performance with respect to the quality of the clusters it uncovers, as judged by its ability to recapitulate protein complexes and functional modules. Finally, we demonstrate the power of our fast network clustering algorithm by applying SPICi across hundreds of large context-specific human networks, and identifying modules specific for single conditions. Availability: Source code is available under the GNU Public License at http://compbio.cs.princeton.edu/spici Contact: mona@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format	Text
id	pubmed-2853685
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-28536852010-04-14 SPICi: a fast clustering algorithm for large biological networks Jiang, Peng Singh, Mona Bioinformatics Original Papers Motivation: Clustering algorithms play an important role in the analysis of biological networks, and can be used to uncover functional modules and obtain hints about cellular organization. While most available clustering algorithms work well on biological networks of moderate size, such as the yeast protein physical interaction network, they either fail or are too slow in practice for larger networks, such as functional networks for higher eukaryotes. Since an increasing number of larger biological networks are being determined, the limitations of current clustering approaches curtail the types of biological network analyses that can be performed. Results: We present a fast local network clustering algorithm SPICi. SPICi runs in time O(V log V+E) and space O(E), where V and E are the number of vertices and edges in the network, respectively. We evaluate SPICi's performance on several existing protein interaction networks of varying size, and compare SPICi to nine previous approaches for clustering biological networks. We show that SPICi is typically several orders of magnitude faster than previous approaches and is the only one that can successfully cluster all test networks within very short time. We demonstrate that SPICi has state-of-the-art performance with respect to the quality of the clusters it uncovers, as judged by its ability to recapitulate protein complexes and functional modules. Finally, we demonstrate the power of our fast network clustering algorithm by applying SPICi across hundreds of large context-specific human networks, and identifying modules specific for single conditions. Availability: Source code is available under the GNU Public License at http://compbio.cs.princeton.edu/spici Contact: mona@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-04-15 2010-02-24 /pmc/articles/PMC2853685/ /pubmed/20185405 http://dx.doi.org/10.1093/bioinformatics/btq078 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Jiang, Peng Singh, Mona SPICi: a fast clustering algorithm for large biological networks
title	SPICi: a fast clustering algorithm for large biological networks
title_full	SPICi: a fast clustering algorithm for large biological networks
title_fullStr	SPICi: a fast clustering algorithm for large biological networks
title_full_unstemmed	SPICi: a fast clustering algorithm for large biological networks
title_short	SPICi: a fast clustering algorithm for large biological networks
title_sort	spici: a fast clustering algorithm for large biological networks
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853685/ https://www.ncbi.nlm.nih.gov/pubmed/20185405 http://dx.doi.org/10.1093/bioinformatics/btq078
work_keys_str_mv	AT jiangpeng spiciafastclusteringalgorithmforlargebiologicalnetworks AT singhmona spiciafastclusteringalgorithmforlargebiologicalnetworks

SPICi: a fast clustering algorithm for large biological networks

Ejemplares similares