计算机科学
并行计算
图形
并发
巨量平行
理论计算机科学
大数据
有界函数
负载平衡(电力)
分布式计算
数据挖掘
数学
几何学
数学分析
网格
作者
Lyuheng Yuan,Da Yan,Wenwen Qu,Saugat Adhikari,Jalal Khalil,Cheng Long,Xiaoling Wang
摘要
Finding frequent subgraph patterns in a big graph is an important problem with many applications such as classifying chemical compounds and building indexes to speed up graph queries. Since this problem is NP-hard, some recent parallel systems have been developed to accelerate the mining. However, they often have a huge memory cost, very long running time, suboptimal load balancing, and possibly inaccurate results. In this paper, we propose an efficient system called T-FSM for parallel mining of frequent subgraph patterns in a big graph. T-FSM adopts a novel task-based execution engine design to ensure high concurrency, bounded memory consumption, and effective load balancing. It also supports a new anti-monotonic frequentness measure called Fraction-Score, which is more accurate than the widely used MNI measure. Our experiments show that T-FSM is orders of magnitude faster than SOTA systems for frequent subgraph pattern mining. Our system code has been released at https://github.com/lyuheng/T-FSM.
科研通智能强力驱动
Strongly Powered by AbleSci AI