Cargando…

A Useful Criterion on Studying Consistent Estimation in Community Detection

In network analysis, developing a unified theoretical framework that can compare methods under different models is an interesting problem. This paper proposes a partial solution to this problem. We summarize the idea of using a separation condition for a standard network and sharp threshold of the E...

Descripción completa

Detalles Bibliográficos
Autor principal: Qing, Huan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407257/
https://www.ncbi.nlm.nih.gov/pubmed/36010762
http://dx.doi.org/10.3390/e24081098
_version_ 1784774320625549312
author Qing, Huan
author_facet Qing, Huan
author_sort Qing, Huan
collection PubMed
description In network analysis, developing a unified theoretical framework that can compare methods under different models is an interesting problem. This paper proposes a partial solution to this problem. We summarize the idea of using a separation condition for a standard network and sharp threshold of the Erdös–Rényi random graph to study consistent estimation, and compare theoretical error rates and requirements on the network sparsity of spectral methods under models that can degenerate to a stochastic block model as a four-step criterion SCSTC. Using SCSTC, we find some inconsistent phenomena on separation condition and sharp threshold in community detection. In particular, we find that the original theoretical results of the SPACL algorithm introduced to estimate network memberships under the mixed membership stochastic blockmodel are sub-optimal. To find the formation mechanism of inconsistencies, we re-establish the theoretical convergence rate of this algorithm by applying recent techniques on row-wise eigenvector deviation. The results are further extended to the degree-corrected mixed membership model. By comparison, our results enjoy smaller error rates, lesser dependence on the number of communities, weaker requirements on network sparsity, and so forth. The separation condition and sharp threshold obtained from our theoretical results match the classical results, so the usefulness of this criterion on studying consistent estimation is guaranteed. Numerical results for computer-generated networks support our finding that spectral methods considered in this paper achieve the threshold of separation condition.
format Online
Article
Text
id pubmed-9407257
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94072572022-08-26 A Useful Criterion on Studying Consistent Estimation in Community Detection Qing, Huan Entropy (Basel) Article In network analysis, developing a unified theoretical framework that can compare methods under different models is an interesting problem. This paper proposes a partial solution to this problem. We summarize the idea of using a separation condition for a standard network and sharp threshold of the Erdös–Rényi random graph to study consistent estimation, and compare theoretical error rates and requirements on the network sparsity of spectral methods under models that can degenerate to a stochastic block model as a four-step criterion SCSTC. Using SCSTC, we find some inconsistent phenomena on separation condition and sharp threshold in community detection. In particular, we find that the original theoretical results of the SPACL algorithm introduced to estimate network memberships under the mixed membership stochastic blockmodel are sub-optimal. To find the formation mechanism of inconsistencies, we re-establish the theoretical convergence rate of this algorithm by applying recent techniques on row-wise eigenvector deviation. The results are further extended to the degree-corrected mixed membership model. By comparison, our results enjoy smaller error rates, lesser dependence on the number of communities, weaker requirements on network sparsity, and so forth. The separation condition and sharp threshold obtained from our theoretical results match the classical results, so the usefulness of this criterion on studying consistent estimation is guaranteed. Numerical results for computer-generated networks support our finding that spectral methods considered in this paper achieve the threshold of separation condition. MDPI 2022-08-09 /pmc/articles/PMC9407257/ /pubmed/36010762 http://dx.doi.org/10.3390/e24081098 Text en © 2022 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Qing, Huan
A Useful Criterion on Studying Consistent Estimation in Community Detection
title A Useful Criterion on Studying Consistent Estimation in Community Detection
title_full A Useful Criterion on Studying Consistent Estimation in Community Detection
title_fullStr A Useful Criterion on Studying Consistent Estimation in Community Detection
title_full_unstemmed A Useful Criterion on Studying Consistent Estimation in Community Detection
title_short A Useful Criterion on Studying Consistent Estimation in Community Detection
title_sort useful criterion on studying consistent estimation in community detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407257/
https://www.ncbi.nlm.nih.gov/pubmed/36010762
http://dx.doi.org/10.3390/e24081098
work_keys_str_mv AT qinghuan ausefulcriteriononstudyingconsistentestimationincommunitydetection
AT qinghuan usefulcriteriononstudyingconsistentestimationincommunitydetection