修剪
可扩展性
序列(生物学)
计算机科学
任务(项目管理)
数据挖掘
航程(航空)
上下界
比例(比率)
人工智能
算法
模式识别(心理学)
数学
工程类
地理
数据库
数学分析
遗传学
农学
生物
地图学
系统工程
航空航天工程
作者
Zhenqiang Ye,Ziyang Li,Wenzhong Guo,Wensheng Gan,Shicheng Wan,Jiahui Chen
标识
DOI:10.1007/978-3-031-08530-7_68
摘要
In the real world, ordered sequence data is commonly seen, and sequence analysis plays an important role in a wide range of real applications, such as market basket analysis. The weight concept helps to find more interesting sequences, whereas they may be treated as meaningless patterns in sequential pattern mining. Therefore, how to effectively discover these high weighted sequences from a quantitative sequential database is an urgent task. Based on the remaining weight concept, we propose a novel algorithm called Fast Weighted Sequential Pattern Mining (FWSPM) by utilizing an upper-bound called the remaining sequence maximum weight. Based on this upper-bound, an effective pruning strategy is designed to reduce the search space and save memory cost. Experimental results on both real and synthetic datasets show that the designed FWSPM algorithm is more efficient than the existing algorithms, and also has good scalability on large-scale datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI