Cargando…

A scalable method for discovering significant subnetworks

BACKGROUND: Study of biological networks is an essential first step to understand the complex functions they govern in different organisms. The topology of interactions that define how biological networks operate is often determined through high-throughput experiments. Noisy nature of high-throughpu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hasan, Md Mahmudul, Kavurucu, Yusuf, Kahveci, Tamer
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854656/ https://www.ncbi.nlm.nih.gov/pubmed/24565174 http://dx.doi.org/10.1186/1752-0509-7-S4-S3

_version_	1782294841270468608
author	Hasan, Md Mahmudul Kavurucu, Yusuf Kahveci, Tamer
author_facet	Hasan, Md Mahmudul Kavurucu, Yusuf Kahveci, Tamer
author_sort	Hasan, Md Mahmudul
collection	PubMed
description	BACKGROUND: Study of biological networks is an essential first step to understand the complex functions they govern in different organisms. The topology of interactions that define how biological networks operate is often determined through high-throughput experiments. Noisy nature of high-throughput experiments, however, can result in multiple alternative network topologies that explain this data equally well. One key step to resolve the differences is to identify the subnetworks which appear significantly more frequently in a biological network data set than expected. METHOD: We present a method named SiS (Significant Subnetworks) to find subnetworks with the largest probability to appear in a collection of biological networks. We define these subnetworks as the most probable subnetworks. SiS summarizes the interactions in the given collection of networks in a special template network. It uses the template network to guide the search for most probable subnetworks. It computes the lower and upper bound scores on how good the potential solutions are (i.e., the number of input networks that contain the subnetwork). As the search continues, it tightens the bound dynamically and prunes a massive number of unpromising solutions in that process. RESULTS AND CONCLUSIONS: Experiments on comprehensive data sets depict that the most probable subnetworks found by SiS in a large collection of networks are also very frequent as well. In metabolic network data set, we found that subnetworks in eukaryote are more conserved than those of prokaryote. SiS also scales well to large data sets and subnetworks and runs orders of magnitude faster than an existing method, MULE. Depending on the size of the subnetwork in the same data set, the running time of SiS ranges from a few seconds to minutes; MULE, on the other hand, runs either for hours or does not even finish in days. In human transcription regulatory network data set, SiS finds a large backbone subnetwork that appears frequently regardless of diverse cell types.
format	Online Article Text
id	pubmed-3854656
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-38546562013-12-16 A scalable method for discovering significant subnetworks Hasan, Md Mahmudul Kavurucu, Yusuf Kahveci, Tamer BMC Syst Biol Research BACKGROUND: Study of biological networks is an essential first step to understand the complex functions they govern in different organisms. The topology of interactions that define how biological networks operate is often determined through high-throughput experiments. Noisy nature of high-throughput experiments, however, can result in multiple alternative network topologies that explain this data equally well. One key step to resolve the differences is to identify the subnetworks which appear significantly more frequently in a biological network data set than expected. METHOD: We present a method named SiS (Significant Subnetworks) to find subnetworks with the largest probability to appear in a collection of biological networks. We define these subnetworks as the most probable subnetworks. SiS summarizes the interactions in the given collection of networks in a special template network. It uses the template network to guide the search for most probable subnetworks. It computes the lower and upper bound scores on how good the potential solutions are (i.e., the number of input networks that contain the subnetwork). As the search continues, it tightens the bound dynamically and prunes a massive number of unpromising solutions in that process. RESULTS AND CONCLUSIONS: Experiments on comprehensive data sets depict that the most probable subnetworks found by SiS in a large collection of networks are also very frequent as well. In metabolic network data set, we found that subnetworks in eukaryote are more conserved than those of prokaryote. SiS also scales well to large data sets and subnetworks and runs orders of magnitude faster than an existing method, MULE. Depending on the size of the subnetwork in the same data set, the running time of SiS ranges from a few seconds to minutes; MULE, on the other hand, runs either for hours or does not even finish in days. In human transcription regulatory network data set, SiS finds a large backbone subnetwork that appears frequently regardless of diverse cell types. BioMed Central 2013-10-23 /pmc/articles/PMC3854656/ /pubmed/24565174 http://dx.doi.org/10.1186/1752-0509-7-S4-S3 Text en Copyright © 2013 Hasan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Hasan, Md Mahmudul Kavurucu, Yusuf Kahveci, Tamer A scalable method for discovering significant subnetworks
title	A scalable method for discovering significant subnetworks
title_full	A scalable method for discovering significant subnetworks
title_fullStr	A scalable method for discovering significant subnetworks
title_full_unstemmed	A scalable method for discovering significant subnetworks
title_short	A scalable method for discovering significant subnetworks
title_sort	scalable method for discovering significant subnetworks
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854656/ https://www.ncbi.nlm.nih.gov/pubmed/24565174 http://dx.doi.org/10.1186/1752-0509-7-S4-S3
work_keys_str_mv	AT hasanmdmahmudul ascalablemethodfordiscoveringsignificantsubnetworks AT kavurucuyusuf ascalablemethodfordiscoveringsignificantsubnetworks AT kahvecitamer ascalablemethodfordiscoveringsignificantsubnetworks AT hasanmdmahmudul scalablemethodfordiscoveringsignificantsubnetworks AT kavurucuyusuf scalablemethodfordiscoveringsignificantsubnetworks AT kahvecitamer scalablemethodfordiscoveringsignificantsubnetworks

A scalable method for discovering significant subnetworks

Ejemplares similares