Cargando…
How threshold behaviour affects the use of subgraphs for network comparison
Motivation: A wealth of protein–protein interaction (PPI) data has recently become available. These data are organized as PPI networks and an efficient and biologically meaningful method to compare such PPI networks is needed. As a first step, we would like to compare observed networks to establishe...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935432/ https://www.ncbi.nlm.nih.gov/pubmed/20823329 http://dx.doi.org/10.1093/bioinformatics/btq386 |
_version_ | 1782186401677180928 |
---|---|
author | Rito, Tiago Wang, Zi Deane, Charlotte M. Reinert, Gesine |
author_facet | Rito, Tiago Wang, Zi Deane, Charlotte M. Reinert, Gesine |
author_sort | Rito, Tiago |
collection | PubMed |
description | Motivation: A wealth of protein–protein interaction (PPI) data has recently become available. These data are organized as PPI networks and an efficient and biologically meaningful method to compare such PPI networks is needed. As a first step, we would like to compare observed networks to established network models, under the aspect of small subgraph counts, as these are conjectured to relate to functional modules in the PPI network. We employ the software tool GraphCrunch with the Graphlet Degree Distribution Agreement (GDDA) score to examine the use of such counts for network comparison. Results: Our results show that the GDDA score has a pronounced dependency on the number of edges and vertices of the networks being considered. This should be taken into account when testing the fit of models. We provide a method for assessing the statistical significance of the fit between random graph models and biological networks based on non-parametric tests. Using this method we examine the fit of Erdös–Rényi (ER), ER with fixed degree distribution and geometric (3D) models to PPI networks. Under these rigorous tests none of these models fit to the PPI networks. The GDDA score is not stable in the region of graph density relevant to current PPI networks. We hypothesize that this score instability is due to the networks under consideration having a graph density in the threshold region for the appearance of small subgraphs. This is true for both geometric (3D) and ER random graph models. Such threshold behaviour may be linked to the robustness and efficiency properties of the PPI networks. Contact: tiago@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2935432 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29354322010-09-08 How threshold behaviour affects the use of subgraphs for network comparison Rito, Tiago Wang, Zi Deane, Charlotte M. Reinert, Gesine Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: A wealth of protein–protein interaction (PPI) data has recently become available. These data are organized as PPI networks and an efficient and biologically meaningful method to compare such PPI networks is needed. As a first step, we would like to compare observed networks to established network models, under the aspect of small subgraph counts, as these are conjectured to relate to functional modules in the PPI network. We employ the software tool GraphCrunch with the Graphlet Degree Distribution Agreement (GDDA) score to examine the use of such counts for network comparison. Results: Our results show that the GDDA score has a pronounced dependency on the number of edges and vertices of the networks being considered. This should be taken into account when testing the fit of models. We provide a method for assessing the statistical significance of the fit between random graph models and biological networks based on non-parametric tests. Using this method we examine the fit of Erdös–Rényi (ER), ER with fixed degree distribution and geometric (3D) models to PPI networks. Under these rigorous tests none of these models fit to the PPI networks. The GDDA score is not stable in the region of graph density relevant to current PPI networks. We hypothesize that this score instability is due to the networks under consideration having a graph density in the threshold region for the appearance of small subgraphs. This is true for both geometric (3D) and ER random graph models. Such threshold behaviour may be linked to the robustness and efficiency properties of the PPI networks. Contact: tiago@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935432/ /pubmed/20823329 http://dx.doi.org/10.1093/bioinformatics/btq386 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Rito, Tiago Wang, Zi Deane, Charlotte M. Reinert, Gesine How threshold behaviour affects the use of subgraphs for network comparison |
title | How threshold behaviour affects the use of subgraphs for network comparison |
title_full | How threshold behaviour affects the use of subgraphs for network comparison |
title_fullStr | How threshold behaviour affects the use of subgraphs for network comparison |
title_full_unstemmed | How threshold behaviour affects the use of subgraphs for network comparison |
title_short | How threshold behaviour affects the use of subgraphs for network comparison |
title_sort | how threshold behaviour affects the use of subgraphs for network comparison |
topic | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935432/ https://www.ncbi.nlm.nih.gov/pubmed/20823329 http://dx.doi.org/10.1093/bioinformatics/btq386 |
work_keys_str_mv | AT ritotiago howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison AT wangzi howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison AT deanecharlottem howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison AT reinertgesine howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison |