Cargando…

An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs

Extraction of subsets of highly connected nodes (“communities” or modules) is a standard step in the analysis of complex social and biological networks. We here consider the problem of finding a relatively small set of nodes in two labeled weighted graphs that is highly connected in both. While many...

Descripción completa

Detalles Bibliográficos
Autores principales: Obradovic, Predrag, Kovačević, Vladimir, Li, Xiqi, Milosavljevic, Aleksandar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9601153/
https://www.ncbi.nlm.nih.gov/pubmed/37420347
http://dx.doi.org/10.3390/e24101329
_version_ 1784816987179843584
author Obradovic, Predrag
Kovačević, Vladimir
Li, Xiqi
Milosavljevic, Aleksandar
author_facet Obradovic, Predrag
Kovačević, Vladimir
Li, Xiqi
Milosavljevic, Aleksandar
author_sort Obradovic, Predrag
collection PubMed
description Extraction of subsets of highly connected nodes (“communities” or modules) is a standard step in the analysis of complex social and biological networks. We here consider the problem of finding a relatively small set of nodes in two labeled weighted graphs that is highly connected in both. While many scoring functions and algorithms tackle the problem, the typically high computational cost of permutation testing required to establish the p-value for the observed pattern presents a major practical obstacle. To address this problem, we here extend the recently proposed CTD (“Connect the Dots”) approach to establish information-theoretic upper bounds on the p-values and lower bounds on the size and connectedness of communities that are detectable. This is an innovation on the applicability of CTD, broadening its use to pairs of graphs.
format Online
Article
Text
id pubmed-9601153
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96011532022-10-27 An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs Obradovic, Predrag Kovačević, Vladimir Li, Xiqi Milosavljevic, Aleksandar Entropy (Basel) Article Extraction of subsets of highly connected nodes (“communities” or modules) is a standard step in the analysis of complex social and biological networks. We here consider the problem of finding a relatively small set of nodes in two labeled weighted graphs that is highly connected in both. While many scoring functions and algorithms tackle the problem, the typically high computational cost of permutation testing required to establish the p-value for the observed pattern presents a major practical obstacle. To address this problem, we here extend the recently proposed CTD (“Connect the Dots”) approach to establish information-theoretic upper bounds on the p-values and lower bounds on the size and connectedness of communities that are detectable. This is an innovation on the applicability of CTD, broadening its use to pairs of graphs. MDPI 2022-09-21 /pmc/articles/PMC9601153/ /pubmed/37420347 http://dx.doi.org/10.3390/e24101329 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Obradovic, Predrag
Kovačević, Vladimir
Li, Xiqi
Milosavljevic, Aleksandar
An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs
title An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs
title_full An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs
title_fullStr An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs
title_full_unstemmed An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs
title_short An Information-Theoretic Bound on p-Values for Detecting Communities Shared between Weighted Labeled Graphs
title_sort information-theoretic bound on p-values for detecting communities shared between weighted labeled graphs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9601153/
https://www.ncbi.nlm.nih.gov/pubmed/37420347
http://dx.doi.org/10.3390/e24101329
work_keys_str_mv AT obradovicpredrag aninformationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs
AT kovacevicvladimir aninformationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs
AT lixiqi aninformationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs
AT milosavljevicaleksandar aninformationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs
AT obradovicpredrag informationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs
AT kovacevicvladimir informationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs
AT lixiqi informationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs
AT milosavljevicaleksandar informationtheoreticboundonpvaluesfordetectingcommunitiessharedbetweenweightedlabeledgraphs