Cargando…

The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

Spectral inference on multiple networks is a rapidly-developing subfield of graph statistics. Recent work has demonstrated that joint, or simultaneous, spectral embedding of multiple independent networks can deliver more accurate estimation than individual spectral decompositions of those same netwo...

Descripción completa

Detalles Bibliográficos
Autores principales: Pantazis, Konstantinos, Athreya, Avanti, Arroyo, Jesús, Frost, William N., Hill, Evan S., Lyzinski, Vince
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10465120/
https://www.ncbi.nlm.nih.gov/pubmed/37645242
_version_ 1785098604492357632
author Pantazis, Konstantinos
Athreya, Avanti
Arroyo, Jesús
Frost, William N.
Hill, Evan S.
Lyzinski, Vince
author_facet Pantazis, Konstantinos
Athreya, Avanti
Arroyo, Jesús
Frost, William N.
Hill, Evan S.
Lyzinski, Vince
author_sort Pantazis, Konstantinos
collection PubMed
description Spectral inference on multiple networks is a rapidly-developing subfield of graph statistics. Recent work has demonstrated that joint, or simultaneous, spectral embedding of multiple independent networks can deliver more accurate estimation than individual spectral decompositions of those same networks. Such inference procedures typically rely heavily on independence assumptions across the multiple network realizations, and even in this case, little attention has been paid to the induced network correlation that can be a consequence of such joint embeddings. In this paper, we present a generalized omnibus embedding methodology and we provide a detailed analysis of this embedding across both independent and correlated networks, the latter of which significantly extends the reach of such procedures, and we describe how this omnibus embedding can itself induce correlation. This leads us to distinguish between inherent correlation—that is, the correlation that arises naturally in multisample network data—and induced correlation, which is an artifice of the joint embedding methodology. We show that the generalized omnibus embedding procedure is flexible and robust, and we prove both consistency and a central limit theorem for the embedded points. We examine how induced and inherent correlation can impact inference for network time series data, and we provide network analogues of classical questions such as the effective sample size for more generally correlated data. Further, we show how an appropriately calibrated generalized omnibus embedding can detect changes in real biological networks that previous embedding procedures could not discern, confirming that the effect of inherent and induced correlation can be subtle and transformative. By allowing for and deconstructing both forms of correlation, our methodology widens the scope of spectral techniques for network inference, with import in theory and practice.
format Online
Article
Text
id pubmed-10465120
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-104651202023-08-29 The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks Pantazis, Konstantinos Athreya, Avanti Arroyo, Jesús Frost, William N. Hill, Evan S. Lyzinski, Vince J Mach Learn Res Article Spectral inference on multiple networks is a rapidly-developing subfield of graph statistics. Recent work has demonstrated that joint, or simultaneous, spectral embedding of multiple independent networks can deliver more accurate estimation than individual spectral decompositions of those same networks. Such inference procedures typically rely heavily on independence assumptions across the multiple network realizations, and even in this case, little attention has been paid to the induced network correlation that can be a consequence of such joint embeddings. In this paper, we present a generalized omnibus embedding methodology and we provide a detailed analysis of this embedding across both independent and correlated networks, the latter of which significantly extends the reach of such procedures, and we describe how this omnibus embedding can itself induce correlation. This leads us to distinguish between inherent correlation—that is, the correlation that arises naturally in multisample network data—and induced correlation, which is an artifice of the joint embedding methodology. We show that the generalized omnibus embedding procedure is flexible and robust, and we prove both consistency and a central limit theorem for the embedded points. We examine how induced and inherent correlation can impact inference for network time series data, and we provide network analogues of classical questions such as the effective sample size for more generally correlated data. Further, we show how an appropriately calibrated generalized omnibus embedding can detect changes in real biological networks that previous embedding procedures could not discern, confirming that the effect of inherent and induced correlation can be subtle and transformative. By allowing for and deconstructing both forms of correlation, our methodology widens the scope of spectral techniques for network inference, with import in theory and practice. 2022 /pmc/articles/PMC10465120/ /pubmed/37645242 Text en https://creativecommons.org/licenses/by/4.0/License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v23/20-944.html.
spellingShingle Article
Pantazis, Konstantinos
Athreya, Avanti
Arroyo, Jesús
Frost, William N.
Hill, Evan S.
Lyzinski, Vince
The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks
title The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks
title_full The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks
title_fullStr The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks
title_full_unstemmed The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks
title_short The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks
title_sort importance of being correlated: implications of dependence in joint spectral inference across multiple networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10465120/
https://www.ncbi.nlm.nih.gov/pubmed/37645242
work_keys_str_mv AT pantaziskonstantinos theimportanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT athreyaavanti theimportanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT arroyojesus theimportanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT frostwilliamn theimportanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT hillevans theimportanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT lyzinskivince theimportanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT pantaziskonstantinos importanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT athreyaavanti importanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT arroyojesus importanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT frostwilliamn importanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT hillevans importanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks
AT lyzinskivince importanceofbeingcorrelatedimplicationsofdependenceinjointspectralinferenceacrossmultiplenetworks