Cargando…
A Useful Criterion on Studying Consistent Estimation in Community Detection
In network analysis, developing a unified theoretical framework that can compare methods under different models is an interesting problem. This paper proposes a partial solution to this problem. We summarize the idea of using a separation condition for a standard network and sharp threshold of the E...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407257/ https://www.ncbi.nlm.nih.gov/pubmed/36010762 http://dx.doi.org/10.3390/e24081098 |
_version_ | 1784774320625549312 |
---|---|
author | Qing, Huan |
author_facet | Qing, Huan |
author_sort | Qing, Huan |
collection | PubMed |
description | In network analysis, developing a unified theoretical framework that can compare methods under different models is an interesting problem. This paper proposes a partial solution to this problem. We summarize the idea of using a separation condition for a standard network and sharp threshold of the Erdös–Rényi random graph to study consistent estimation, and compare theoretical error rates and requirements on the network sparsity of spectral methods under models that can degenerate to a stochastic block model as a four-step criterion SCSTC. Using SCSTC, we find some inconsistent phenomena on separation condition and sharp threshold in community detection. In particular, we find that the original theoretical results of the SPACL algorithm introduced to estimate network memberships under the mixed membership stochastic blockmodel are sub-optimal. To find the formation mechanism of inconsistencies, we re-establish the theoretical convergence rate of this algorithm by applying recent techniques on row-wise eigenvector deviation. The results are further extended to the degree-corrected mixed membership model. By comparison, our results enjoy smaller error rates, lesser dependence on the number of communities, weaker requirements on network sparsity, and so forth. The separation condition and sharp threshold obtained from our theoretical results match the classical results, so the usefulness of this criterion on studying consistent estimation is guaranteed. Numerical results for computer-generated networks support our finding that spectral methods considered in this paper achieve the threshold of separation condition. |
format | Online Article Text |
id | pubmed-9407257 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94072572022-08-26 A Useful Criterion on Studying Consistent Estimation in Community Detection Qing, Huan Entropy (Basel) Article In network analysis, developing a unified theoretical framework that can compare methods under different models is an interesting problem. This paper proposes a partial solution to this problem. We summarize the idea of using a separation condition for a standard network and sharp threshold of the Erdös–Rényi random graph to study consistent estimation, and compare theoretical error rates and requirements on the network sparsity of spectral methods under models that can degenerate to a stochastic block model as a four-step criterion SCSTC. Using SCSTC, we find some inconsistent phenomena on separation condition and sharp threshold in community detection. In particular, we find that the original theoretical results of the SPACL algorithm introduced to estimate network memberships under the mixed membership stochastic blockmodel are sub-optimal. To find the formation mechanism of inconsistencies, we re-establish the theoretical convergence rate of this algorithm by applying recent techniques on row-wise eigenvector deviation. The results are further extended to the degree-corrected mixed membership model. By comparison, our results enjoy smaller error rates, lesser dependence on the number of communities, weaker requirements on network sparsity, and so forth. The separation condition and sharp threshold obtained from our theoretical results match the classical results, so the usefulness of this criterion on studying consistent estimation is guaranteed. Numerical results for computer-generated networks support our finding that spectral methods considered in this paper achieve the threshold of separation condition. MDPI 2022-08-09 /pmc/articles/PMC9407257/ /pubmed/36010762 http://dx.doi.org/10.3390/e24081098 Text en © 2022 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Qing, Huan A Useful Criterion on Studying Consistent Estimation in Community Detection |
title | A Useful Criterion on Studying Consistent Estimation in Community Detection |
title_full | A Useful Criterion on Studying Consistent Estimation in Community Detection |
title_fullStr | A Useful Criterion on Studying Consistent Estimation in Community Detection |
title_full_unstemmed | A Useful Criterion on Studying Consistent Estimation in Community Detection |
title_short | A Useful Criterion on Studying Consistent Estimation in Community Detection |
title_sort | useful criterion on studying consistent estimation in community detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407257/ https://www.ncbi.nlm.nih.gov/pubmed/36010762 http://dx.doi.org/10.3390/e24081098 |
work_keys_str_mv | AT qinghuan ausefulcriteriononstudyingconsistentestimationincommunitydetection AT qinghuan usefulcriteriononstudyingconsistentestimationincommunitydetection |