Cargando…

How threshold behaviour affects the use of subgraphs for network comparison

Motivation: A wealth of protein–protein interaction (PPI) data has recently become available. These data are organized as PPI networks and an efficient and biologically meaningful method to compare such PPI networks is needed. As a first step, we would like to compare observed networks to establishe...

Descripción completa

Detalles Bibliográficos
Autores principales: Rito, Tiago, Wang, Zi, Deane, Charlotte M., Reinert, Gesine
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935432/
https://www.ncbi.nlm.nih.gov/pubmed/20823329
http://dx.doi.org/10.1093/bioinformatics/btq386
_version_ 1782186401677180928
author Rito, Tiago
Wang, Zi
Deane, Charlotte M.
Reinert, Gesine
author_facet Rito, Tiago
Wang, Zi
Deane, Charlotte M.
Reinert, Gesine
author_sort Rito, Tiago
collection PubMed
description Motivation: A wealth of protein–protein interaction (PPI) data has recently become available. These data are organized as PPI networks and an efficient and biologically meaningful method to compare such PPI networks is needed. As a first step, we would like to compare observed networks to established network models, under the aspect of small subgraph counts, as these are conjectured to relate to functional modules in the PPI network. We employ the software tool GraphCrunch with the Graphlet Degree Distribution Agreement (GDDA) score to examine the use of such counts for network comparison. Results: Our results show that the GDDA score has a pronounced dependency on the number of edges and vertices of the networks being considered. This should be taken into account when testing the fit of models. We provide a method for assessing the statistical significance of the fit between random graph models and biological networks based on non-parametric tests. Using this method we examine the fit of Erdös–Rényi (ER), ER with fixed degree distribution and geometric (3D) models to PPI networks. Under these rigorous tests none of these models fit to the PPI networks. The GDDA score is not stable in the region of graph density relevant to current PPI networks. We hypothesize that this score instability is due to the networks under consideration having a graph density in the threshold region for the appearance of small subgraphs. This is true for both geometric (3D) and ER random graph models. Such threshold behaviour may be linked to the robustness and efficiency properties of the PPI networks. Contact: tiago@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2935432
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29354322010-09-08 How threshold behaviour affects the use of subgraphs for network comparison Rito, Tiago Wang, Zi Deane, Charlotte M. Reinert, Gesine Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: A wealth of protein–protein interaction (PPI) data has recently become available. These data are organized as PPI networks and an efficient and biologically meaningful method to compare such PPI networks is needed. As a first step, we would like to compare observed networks to established network models, under the aspect of small subgraph counts, as these are conjectured to relate to functional modules in the PPI network. We employ the software tool GraphCrunch with the Graphlet Degree Distribution Agreement (GDDA) score to examine the use of such counts for network comparison. Results: Our results show that the GDDA score has a pronounced dependency on the number of edges and vertices of the networks being considered. This should be taken into account when testing the fit of models. We provide a method for assessing the statistical significance of the fit between random graph models and biological networks based on non-parametric tests. Using this method we examine the fit of Erdös–Rényi (ER), ER with fixed degree distribution and geometric (3D) models to PPI networks. Under these rigorous tests none of these models fit to the PPI networks. The GDDA score is not stable in the region of graph density relevant to current PPI networks. We hypothesize that this score instability is due to the networks under consideration having a graph density in the threshold region for the appearance of small subgraphs. This is true for both geometric (3D) and ER random graph models. Such threshold behaviour may be linked to the robustness and efficiency properties of the PPI networks. Contact: tiago@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935432/ /pubmed/20823329 http://dx.doi.org/10.1093/bioinformatics/btq386 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
Rito, Tiago
Wang, Zi
Deane, Charlotte M.
Reinert, Gesine
How threshold behaviour affects the use of subgraphs for network comparison
title How threshold behaviour affects the use of subgraphs for network comparison
title_full How threshold behaviour affects the use of subgraphs for network comparison
title_fullStr How threshold behaviour affects the use of subgraphs for network comparison
title_full_unstemmed How threshold behaviour affects the use of subgraphs for network comparison
title_short How threshold behaviour affects the use of subgraphs for network comparison
title_sort how threshold behaviour affects the use of subgraphs for network comparison
topic Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935432/
https://www.ncbi.nlm.nih.gov/pubmed/20823329
http://dx.doi.org/10.1093/bioinformatics/btq386
work_keys_str_mv AT ritotiago howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison
AT wangzi howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison
AT deanecharlottem howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison
AT reinertgesine howthresholdbehaviouraffectstheuseofsubgraphsfornetworkcomparison