计算机科学
数据挖掘
时间戳
数据流
差别隐私
数据流挖掘
统计的
滑动窗口协议
出版
窗口(计算)
计算机安全
电信
统计
数学
广告
业务
操作系统
作者
Xuebin Ma,Shengyi Guan,Yanan Lang
标识
DOI:10.1109/trustcom60117.2023.00124
摘要
As the most common technique for mining and analyzing massive data, frequent itemset mining is widely used in various scenarios. However, when the data contains sensitive information, it will bring serious privacy leakage risks to mining and publishing it directly. Therefore, how to efficiently mine frequent itemset without privacy disclosure is a hot issue at present. In data stream, because the data between adjacent timestamps possess certain correlation, which makes it easier to leak privacy for frequent itemset mining in the data stream, and considering that frequent itemset will have combinatorial explosion problem in data stream, frequent closed itemset mining in the data stream will be a better choice. In this paper, we propose a differential privacy algorithm for mining frequent closed itemset in data stream, which is referred to as DPES. The algorithm uses a vertical mining method in each sliding window, which can quickly obtain the frequent closed itemset contained in the current window. Besides, to improve the data utility, we also design an adaptive privacy budget allocation strategy by computing the difference to decide whether the current timestamp should publish a low-noise statistic or an approximate statistic. Finally, we demonstrate that the DPES algorithm satisfies ε-difference privacy by privacy analysis, and the experimental results on several real datasets also show the effectiveness of the DPES algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI