Cargando…

A scalable method for discovering significant subnetworks

BACKGROUND: Study of biological networks is an essential first step to understand the complex functions they govern in different organisms. The topology of interactions that define how biological networks operate is often determined through high-throughput experiments. Noisy nature of high-throughpu...

Descripción completa

Detalles Bibliográficos
Autores principales: Hasan, Md Mahmudul, Kavurucu, Yusuf, Kahveci, Tamer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854656/
https://www.ncbi.nlm.nih.gov/pubmed/24565174
http://dx.doi.org/10.1186/1752-0509-7-S4-S3
_version_ 1782294841270468608
author Hasan, Md Mahmudul
Kavurucu, Yusuf
Kahveci, Tamer
author_facet Hasan, Md Mahmudul
Kavurucu, Yusuf
Kahveci, Tamer
author_sort Hasan, Md Mahmudul
collection PubMed
description BACKGROUND: Study of biological networks is an essential first step to understand the complex functions they govern in different organisms. The topology of interactions that define how biological networks operate is often determined through high-throughput experiments. Noisy nature of high-throughput experiments, however, can result in multiple alternative network topologies that explain this data equally well. One key step to resolve the differences is to identify the subnetworks which appear significantly more frequently in a biological network data set than expected. METHOD: We present a method named SiS (Significant Subnetworks) to find subnetworks with the largest probability to appear in a collection of biological networks. We define these subnetworks as the most probable subnetworks. SiS summarizes the interactions in the given collection of networks in a special template network. It uses the template network to guide the search for most probable subnetworks. It computes the lower and upper bound scores on how good the potential solutions are (i.e., the number of input networks that contain the subnetwork). As the search continues, it tightens the bound dynamically and prunes a massive number of unpromising solutions in that process. RESULTS AND CONCLUSIONS: Experiments on comprehensive data sets depict that the most probable subnetworks found by SiS in a large collection of networks are also very frequent as well. In metabolic network data set, we found that subnetworks in eukaryote are more conserved than those of prokaryote. SiS also scales well to large data sets and subnetworks and runs orders of magnitude faster than an existing method, MULE. Depending on the size of the subnetwork in the same data set, the running time of SiS ranges from a few seconds to minutes; MULE, on the other hand, runs either for hours or does not even finish in days. In human transcription regulatory network data set, SiS finds a large backbone subnetwork that appears frequently regardless of diverse cell types.
format Online
Article
Text
id pubmed-3854656
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38546562013-12-16 A scalable method for discovering significant subnetworks Hasan, Md Mahmudul Kavurucu, Yusuf Kahveci, Tamer BMC Syst Biol Research BACKGROUND: Study of biological networks is an essential first step to understand the complex functions they govern in different organisms. The topology of interactions that define how biological networks operate is often determined through high-throughput experiments. Noisy nature of high-throughput experiments, however, can result in multiple alternative network topologies that explain this data equally well. One key step to resolve the differences is to identify the subnetworks which appear significantly more frequently in a biological network data set than expected. METHOD: We present a method named SiS (Significant Subnetworks) to find subnetworks with the largest probability to appear in a collection of biological networks. We define these subnetworks as the most probable subnetworks. SiS summarizes the interactions in the given collection of networks in a special template network. It uses the template network to guide the search for most probable subnetworks. It computes the lower and upper bound scores on how good the potential solutions are (i.e., the number of input networks that contain the subnetwork). As the search continues, it tightens the bound dynamically and prunes a massive number of unpromising solutions in that process. RESULTS AND CONCLUSIONS: Experiments on comprehensive data sets depict that the most probable subnetworks found by SiS in a large collection of networks are also very frequent as well. In metabolic network data set, we found that subnetworks in eukaryote are more conserved than those of prokaryote. SiS also scales well to large data sets and subnetworks and runs orders of magnitude faster than an existing method, MULE. Depending on the size of the subnetwork in the same data set, the running time of SiS ranges from a few seconds to minutes; MULE, on the other hand, runs either for hours or does not even finish in days. In human transcription regulatory network data set, SiS finds a large backbone subnetwork that appears frequently regardless of diverse cell types. BioMed Central 2013-10-23 /pmc/articles/PMC3854656/ /pubmed/24565174 http://dx.doi.org/10.1186/1752-0509-7-S4-S3 Text en Copyright © 2013 Hasan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Hasan, Md Mahmudul
Kavurucu, Yusuf
Kahveci, Tamer
A scalable method for discovering significant subnetworks
title A scalable method for discovering significant subnetworks
title_full A scalable method for discovering significant subnetworks
title_fullStr A scalable method for discovering significant subnetworks
title_full_unstemmed A scalable method for discovering significant subnetworks
title_short A scalable method for discovering significant subnetworks
title_sort scalable method for discovering significant subnetworks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854656/
https://www.ncbi.nlm.nih.gov/pubmed/24565174
http://dx.doi.org/10.1186/1752-0509-7-S4-S3
work_keys_str_mv AT hasanmdmahmudul ascalablemethodfordiscoveringsignificantsubnetworks
AT kavurucuyusuf ascalablemethodfordiscoveringsignificantsubnetworks
AT kahvecitamer ascalablemethodfordiscoveringsignificantsubnetworks
AT hasanmdmahmudul scalablemethodfordiscoveringsignificantsubnetworks
AT kavurucuyusuf scalablemethodfordiscoveringsignificantsubnetworks
AT kahvecitamer scalablemethodfordiscoveringsignificantsubnetworks