Cargando…

Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8

Graphlets are small connected induced subgraphs of a larger graph G. Graphlets are now commonly used to quantify local and global topology of networks in the field. Methods exist to exhaustively enumerate all graphlets (and their orbits) in large networks as efficiently as possible using orbit count...

Descripción completa

Detalles Bibliográficos
Autores principales: Hasan, Adib, Chung, Po-Chien, Hayes, Wayne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568234/
https://www.ncbi.nlm.nih.gov/pubmed/28832661
http://dx.doi.org/10.1371/journal.pone.0181570
_version_ 1783258819783032832
author Hasan, Adib
Chung, Po-Chien
Hayes, Wayne
author_facet Hasan, Adib
Chung, Po-Chien
Hayes, Wayne
author_sort Hasan, Adib
collection PubMed
description Graphlets are small connected induced subgraphs of a larger graph G. Graphlets are now commonly used to quantify local and global topology of networks in the field. Methods exist to exhaustively enumerate all graphlets (and their orbits) in large networks as efficiently as possible using orbit counting equations. However, the number of graphlets in G is exponential in both the number of nodes and edges in G. Enumerating them all is already unacceptably expensive on existing large networks, and the problem will only get worse as networks continue to grow in size and density. Here we introduce an efficient method designed to aid statistical sampling of graphlets up to size k = 8 from a large network. We define graphettes as the generalization of graphlets allowing for disconnected graphlets. Given a particular (undirected) graphette g, we introduce the idea of the canonical graphette [Image: see text] as a representative member of the isomorphism group Iso(g) of g. We compute the mapping [Image: see text] , in the form of a lookup table, from all 2(k(k − 1)/2) undirected graphettes g of size k ≤ 8 to their canonical representatives [Image: see text] , as well as the permutation that transforms g to [Image: see text] . We also compute all automorphism orbits for each canonical graphette. Thus, given any k ≤ 8 nodes in a graph G, we can in constant time infer which graphette it is, as well as which orbit each of the k nodes belongs to. Sampling a large number N of such k-sets of nodes provides an approximation of both the distribution of graphlets and orbits across G, and the orbit degree vector at each node.
format Online
Article
Text
id pubmed-5568234
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55682342017-09-09 Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8 Hasan, Adib Chung, Po-Chien Hayes, Wayne PLoS One Research Article Graphlets are small connected induced subgraphs of a larger graph G. Graphlets are now commonly used to quantify local and global topology of networks in the field. Methods exist to exhaustively enumerate all graphlets (and their orbits) in large networks as efficiently as possible using orbit counting equations. However, the number of graphlets in G is exponential in both the number of nodes and edges in G. Enumerating them all is already unacceptably expensive on existing large networks, and the problem will only get worse as networks continue to grow in size and density. Here we introduce an efficient method designed to aid statistical sampling of graphlets up to size k = 8 from a large network. We define graphettes as the generalization of graphlets allowing for disconnected graphlets. Given a particular (undirected) graphette g, we introduce the idea of the canonical graphette [Image: see text] as a representative member of the isomorphism group Iso(g) of g. We compute the mapping [Image: see text] , in the form of a lookup table, from all 2(k(k − 1)/2) undirected graphettes g of size k ≤ 8 to their canonical representatives [Image: see text] , as well as the permutation that transforms g to [Image: see text] . We also compute all automorphism orbits for each canonical graphette. Thus, given any k ≤ 8 nodes in a graph G, we can in constant time infer which graphette it is, as well as which orbit each of the k nodes belongs to. Sampling a large number N of such k-sets of nodes provides an approximation of both the distribution of graphlets and orbits across G, and the orbit degree vector at each node. Public Library of Science 2017-08-23 /pmc/articles/PMC5568234/ /pubmed/28832661 http://dx.doi.org/10.1371/journal.pone.0181570 Text en © 2017 Hasan et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hasan, Adib
Chung, Po-Chien
Hayes, Wayne
Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8
title Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8
title_full Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8
title_fullStr Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8
title_full_unstemmed Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8
title_short Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8
title_sort graphettes: constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568234/
https://www.ncbi.nlm.nih.gov/pubmed/28832661
http://dx.doi.org/10.1371/journal.pone.0181570
work_keys_str_mv AT hasanadib graphettesconstanttimedeterminationofgraphletandorbitidentityincludingpossiblydisconnectedgraphletsuptosize8
AT chungpochien graphettesconstanttimedeterminationofgraphletandorbitidentityincludingpossiblydisconnectedgraphletsuptosize8
AT hayeswayne graphettesconstanttimedeterminationofgraphletandorbitidentityincludingpossiblydisconnectedgraphletsuptosize8