差别隐私
计算机科学
匹配(统计)
数据挖掘
算法
数学
统计
作者
Miao Zhang,Xinxin Ye,Hai Deng
标识
DOI:10.1007/978-981-97-0798-0_8
摘要
Differential privacy synthetic data is one of the most effective methods for privacy preserving data release. However, the existing schemes still suffer from high computational complexity and inability to directly handle values of large domain size when synthesizing high-dimensional data. To mitigate this gap, we propose synthetic data generation for differential privacy using maximum weight matching (DPMWM), a method for automatically synthesizing tabular data in high-dimensional large domain size via differential privacy. Specifically, DPMWM uses differential privacy maximum weight matching for low-dimensional marginal selection and then automatically synthesizes multiple records based on the filtered marginals. The experimental results show that DPMWM outperforms the state-of-the-art in terms of accuracy for counting queries and classification tasks on datasets with larger domain size.
科研通智能强力驱动
Strongly Powered by AbleSci AI