计算机科学
信息隐私
数据建模
扩散
互联网隐私
计算机安全
数据库
物理
热力学
作者
Quanlong Guan,Yanchong Yu,Xiujie Huang,Liangda Fang,Chaobo He,Lusheng Wu,Weiqi Luo,Guanliang Chen
标识
DOI:10.1145/3589335.3651511
摘要
Educational Data Records (EDR) are crucial for capturing teaching behavior and student information, forming the basis for achieving educational intelligence. However, ensuring educational privacy has become a pressing concern, posing practical challenges to the use and sharing of educational data. To address the issue of EDR privacy preserving, we present EduSyn, a privacy data release scheme that utilizes generative diffusion models and differential privacy methods. Specifically, we adopt a diffusion modeling scheme that can be applied to both discrete and continuous types of data to accommodate the data characteristics of EDR, while an invariant Post Randomization (PRAM) perturbation method that satisfies local differential privacy is applied for data attributes that need to be specially protected before model training. We conduct comprehensive validation of this scheme within the domain of education applications, showcasing that EduSyn generates a superior private EDR dataset compared to similar generative methods and strikes a better privacy-utility trade-off.
科研通智能强力驱动
Strongly Powered by AbleSci AI