分歧(语言学)
协同过滤
相似性(几何)
Kullback-Leibler散度
相似性度量
计算机科学
度量(数据仓库)
数据挖掘
灵活性(工程)
约束(计算机辅助设计)
人工智能
数学
情报检索
推荐系统
统计
图像(数学)
几何学
哲学
语言学
作者
Jiangzhou Deng,Yong Wang,Junpeng Guo,Yongheng Deng,Jerry Gao,Younghee Park
标识
DOI:10.1177/0165551518808188
摘要
In the neighbourhood-based collaborative filtering (CF) algorithms, a user similarity measure is used to find other users similar to an active user. Most of the existing user similarity measures rely on the co-rated items. However, there are not enough co-rated items in sparse dataset, which usually leads to poor prediction. In this article, a new similarity scheme is proposed, which breaks free of the constraint of the co-rated items. Moreover, an item similarity measure based on the Kullback–Leibler (KL) divergence is presented, which identifies the relation between items based on the probability density distribution of ratings. Since the item similarity based on KL divergence makes full use of all ratings, it owns better flexibility for sparse datasets. The CF algorithm using our proposed similarity scheme is implemented and compared with some classic CF algorithms. The compared results show that the CF using our similarity has better predictive performance. Therefore, our similarity scheme is a good solution for the sparsity problem and has great potential to be applied to recommendation systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI