Cargando…

Graphlet correlation distance to compare small graphs

Graph models are standard for representing mutual relationships between sets of entities. Often, graphs deal with a large number of entities with a small number of connections (e.g. social media relationships, infectious disease spread). The distances or similarities between such large graphs are kn...

Descripción completa

Detalles Bibliográficos
Autores principales: Roux, Jérôme, Bez, Nicolas, Rochet, Paul, Joo, Rocío, Mahévas, Stéphanie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931116/
https://www.ncbi.nlm.nih.gov/pubmed/36791120
http://dx.doi.org/10.1371/journal.pone.0281646
_version_ 1784889176461672448
author Roux, Jérôme
Bez, Nicolas
Rochet, Paul
Joo, Rocío
Mahévas, Stéphanie
author_facet Roux, Jérôme
Bez, Nicolas
Rochet, Paul
Joo, Rocío
Mahévas, Stéphanie
author_sort Roux, Jérôme
collection PubMed
description Graph models are standard for representing mutual relationships between sets of entities. Often, graphs deal with a large number of entities with a small number of connections (e.g. social media relationships, infectious disease spread). The distances or similarities between such large graphs are known to be well established by the Graphlet Correlation Distance (GCD). This paper deals with small graphs (with potentially high densities of connections) that have been somewhat neglected in the literature but that concern important fora like sociology, ecology and fisheries, to mention some examples. First, based on numerical experiments, we study the conditions under which Erdős-Rényi, Fitness Scale-Free, Watts-Strogatz small-world and geometric graphs can be distinguished by a specific GCD measure based on 11 orbits, the GCD(11). This is done with respect to the density and the order (i.e. the number of nodes) of the graphs when comparing graphs with the same and different orders. Second, we develop a randomization statistical test based on the GCD(11) to compare empirical graphs to the four possible null models used in this analysis and apply it to a fishing case study where graphs represent pairwise proximity between fishing vessels. The statistical test rules out independent pairing within the fleet studied which is a standard assumption in fisheries. It also illustrates the difficulty to identify similarities between real-world small graphs and graph models.
format Online
Article
Text
id pubmed-9931116
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99311162023-02-16 Graphlet correlation distance to compare small graphs Roux, Jérôme Bez, Nicolas Rochet, Paul Joo, Rocío Mahévas, Stéphanie PLoS One Research Article Graph models are standard for representing mutual relationships between sets of entities. Often, graphs deal with a large number of entities with a small number of connections (e.g. social media relationships, infectious disease spread). The distances or similarities between such large graphs are known to be well established by the Graphlet Correlation Distance (GCD). This paper deals with small graphs (with potentially high densities of connections) that have been somewhat neglected in the literature but that concern important fora like sociology, ecology and fisheries, to mention some examples. First, based on numerical experiments, we study the conditions under which Erdős-Rényi, Fitness Scale-Free, Watts-Strogatz small-world and geometric graphs can be distinguished by a specific GCD measure based on 11 orbits, the GCD(11). This is done with respect to the density and the order (i.e. the number of nodes) of the graphs when comparing graphs with the same and different orders. Second, we develop a randomization statistical test based on the GCD(11) to compare empirical graphs to the four possible null models used in this analysis and apply it to a fishing case study where graphs represent pairwise proximity between fishing vessels. The statistical test rules out independent pairing within the fleet studied which is a standard assumption in fisheries. It also illustrates the difficulty to identify similarities between real-world small graphs and graph models. Public Library of Science 2023-02-15 /pmc/articles/PMC9931116/ /pubmed/36791120 http://dx.doi.org/10.1371/journal.pone.0281646 Text en © 2023 Roux et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Roux, Jérôme
Bez, Nicolas
Rochet, Paul
Joo, Rocío
Mahévas, Stéphanie
Graphlet correlation distance to compare small graphs
title Graphlet correlation distance to compare small graphs
title_full Graphlet correlation distance to compare small graphs
title_fullStr Graphlet correlation distance to compare small graphs
title_full_unstemmed Graphlet correlation distance to compare small graphs
title_short Graphlet correlation distance to compare small graphs
title_sort graphlet correlation distance to compare small graphs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931116/
https://www.ncbi.nlm.nih.gov/pubmed/36791120
http://dx.doi.org/10.1371/journal.pone.0281646
work_keys_str_mv AT rouxjerome graphletcorrelationdistancetocomparesmallgraphs
AT beznicolas graphletcorrelationdistancetocomparesmallgraphs
AT rochetpaul graphletcorrelationdistancetocomparesmallgraphs
AT joorocio graphletcorrelationdistancetocomparesmallgraphs
AT mahevasstephanie graphletcorrelationdistancetocomparesmallgraphs