five

Optimal Estimation of the Number of Network Communities

收藏
DataCite Commons2022-03-17 更新2024-07-29 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Optimal_Estimation_of_the_Number_of_Communities/19102907/2
下载链接
链接失效反馈
官方服务:
资源简介:
In network analysis, how to estimate the number of communities <i>K</i> is a fundamental problem. We consider a broad setting where we allow severe degree heterogeneity and a wide range of sparsity levels, and propose Stepwise Goodness of Fit (StGoF) as a new approach. This is a stepwise algorithm, where for m=1,2,…, we alternately use a community detection step and a goodness of fit (GoF) step. We adapt SCORE Jin for community detection, and propose a new GoF metric. We show that at step <i>m</i>, the GoF metric diverges to ∞ in probability for all <i>m</i> &lt; <i>K</i> and converges to <i>N</i>(0, 1) if <i>m</i> = <i>K</i>. This gives rise to a consistent estimate for <i>K</i>. Also, we discover the right way to define the signal-to-noise ratio (SNR) for our problem and show that consistent estimates for <i>K</i> do not exist if SNR→0, and StGoF is uniformly consistent for <i>K</i> if SNR→∞. Therefore, StGoF achieves the optimal phase transition. Similar stepwise methods are known to face analytical challenges. We overcome the challenges by using a different stepwise scheme in StGoF and by deriving sharp results that are not available before. The key to our analysis is to show that SCORE has the <i>Nonsplitting Property (NSP)</i>. Primarily due to a nontractable rotation of eigenvectors dictated by the Davis–Kahan sin (θ) theorem, the NSP is nontrivial to prove and requires new techniques we develop. Supplementary materials for this article are available online.
提供机构:
Taylor & Francis
创建时间:
2022-03-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作