计算机科学
散列函数
特征(语言学)
特征哈希
超参数
局部敏感散列
人工智能
哈希表
相似性(几何)
模式识别(心理学)
窗口(计算)
特征向量
机器学习
数据挖掘
图像(数学)
哲学
语言学
计算机安全
双重哈希
操作系统
作者
Pengfei Ma,Youxi Wu,Yan Li,Lei Guo,He Jiang,Xingquan Zhu,Xindong Wu
出处
期刊:ACM Transactions on Knowledge Discovery From Data
[Association for Computing Machinery]
日期:2022-05-04
卷期号:16 (6): 1-24
被引量:15
摘要
As a novel deep learning model, gcForest has been widely used in various applications. However, current multi-grained scanning of gcForest produces many redundant feature vectors, and this increases the time cost of the model. To screen out redundant feature vectors, we introduce a hashing screening mechanism for multi-grained scanning and propose a model called HW-Forest which adopts two strategies: hashing screening and window screening. HW-Forest employs perceptual hashing algorithm to calculate the similarity between feature vectors in hashing screening strategy, which is used to remove the redundant feature vectors produced by multi-grained scanning and can significantly decrease the time cost and memory consumption. Furthermore, we adopt a self-adaptive instance screening strategy called window screening to improve the performance of our approach, which can achieve higher accuracy without hyperparameter tuning on different datasets. Our experimental results show that HW-Forest has higher accuracy than other models, and the time cost is also reduced.
科研通智能强力驱动
Strongly Powered by AbleSci AI