计算机科学
差别隐私
联合学习
原始数据
前提
信息隐私
特征(语言学)
机器学习
数据建模
数据挖掘
大数据
图层(电子)
人工智能
特征选择
医疗保健
数据共享
计算机安全
数据库
医学
语言学
哲学
化学
替代医学
有机化学
病理
经济
程序设计语言
经济增长
作者
Tanzir Ul Islam,Reza Ghasemi,Noman Mohammed
标识
DOI:10.1109/ccwc54503.2022.9720752
摘要
Federated Machine Learning (FL) can be used effectively in distributed datasets, where data owners hesitate to share their raw data, as a reliable approach to train an ML algorithm. However, in the case of sensitive healthcare datasets, additional privacy measures before feeding into machine learning mechanisms are also necessary. Our approach uses the federated learning framework, which removes the necessity of sharing patients' sensitive data in a raw format outside the premise. First, the data owners agree on a list of features selected by the correlation; then, after training the local models, the obtained local models are transmitted to the central server for aggregation. The differential privacy (DP) approach is adopted to perturb the local models before transmission to add an extra privacy layer. As a result, our framework achieves improved utility as the feature selection reduces the data dimension. Finally, based on the patient's genomic data, the framework establishes a practical healthcare application to privacy-predict certain heart failure/cancer diseases. application to predict certain heart failure diseases in a private manner.
科研通智能强力驱动
Strongly Powered by AbleSci AI