修剪
计算机科学
概率逻辑
线性子空间
钥匙(锁)
采样(信号处理)
图形
顶点(图论)
算法
维数(图论)
模式识别(心理学)
人工智能
理论计算机科学
数学
组合数学
滤波器(信号处理)
几何学
计算机安全
农学
计算机视觉
生物
作者
Giulia Preti,Gianmarco De Francisci Morales,Matteo Riondato
出处
期刊:ACM Transactions on Intelligent Systems and Technology
[Association for Computing Machinery]
日期:2023-04-13
卷期号:14 (3): 1-29
被引量:1
摘要
We present MaNIACS , a sampling-based randomized algorithm for computing high-quality approximations of the collection of the subgraph patterns that are frequent in a single, large, vertex-labeled graph, according to the Minimum Node Image-based (MNI) frequency measure. The output of MaNIACS comes with strong probabilistic guarantees, obtained by using the empirical Vapnik–Chervonenkis (VC) dimension, a key concept from statistical learning theory, together with strong probabilistic tail bounds on the difference between the frequency of a pattern in the sample and its exact frequency. MaNIACS leverages properties of the MNI-frequency to aggressively prune the pattern search space, and thus to reduce the time spent in exploring subspaces that contain no frequent patterns. In turn, this pruning leads to better bounds to the maximum frequency estimation error, which leads to increased pruning, resulting in a beneficial feedback effect. The results of our experimental evaluation of MaNIACS on real graphs show that it returns high-quality collections of frequent patterns in large graphs up to two orders of magnitude faster than the exact algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI