电导
星团(航天器)
估计员
推论
光谱聚类
分拆(数论)
计算机科学
聚类分析
图划分
数学
拉普拉斯矩阵
图形
理论计算机科学
统计物理学
数据挖掘
物理
组合数学
人工智能
统计
程序设计语言
出处
期刊:Cornell University - arXiv
日期:2021-03-01
被引量:1
标识
DOI:10.48550/arxiv.2103.01470
摘要
Since network data commonly consists of observations from a single large network, researchers often partition the network into clusters in order to apply cluster-robust inference methods. Existing such methods require clusters to be asymptotically independent. Under mild conditions, we prove that, for this requirement to hold for network-dependent data, it is necessary and sufficient that clusters have low conductance, the ratio of edge boundary size to volume. This yields a simple measure of cluster quality. We find in simulations that when clusters have low conductance, cluster-robust methods control size better than HAC estimators. However, for important classes of networks lacking low-conductance clusters, the former can exhibit substantial size distortion. To determine the number of low-conductance clusters and construct them, we draw on results in spectral graph theory that connect conductance to the spectrum of the graph Laplacian. Based on these results, we propose to use the spectrum to determine the number of low-conductance clusters and spectral clustering to construct them.
科研通智能强力驱动
Strongly Powered by AbleSci AI