Data pricing for vertical federated learning: an approach based on data contribution

计算机科学 斯塔克伯格竞赛 数据建模 构造(python库) 信息隐私 保护 数据挖掘 机器学习 数据科学 人工智能 数据库 计算机安全 医学 护理部 数学 数理经济学 程序设计语言
作者
Zhixian Zhang,Xinchao Li,Shiyou Yang
标识
DOI:10.1117/12.2681630
摘要

Federated Learning (FedL) emerged as a privacy-aware alternative, creating an effective means for multiple data providers to enable collaboration on training models without accessing the original data. Vertical federated learning (VFedL), as a crucial classification within FedL, has always been primarily utilized to train a machine learning model with non-uniform data from different providers. Despite the VFedL's benefits in facilitating collaborative training models while safeguarding data privacy, it remains a daunting challenge to incentivize more valuable data providers to participate in the VFedL due to the absence of scientific data pricing and precise measurement of data contributions from participants in practical operations. In this paper, we construct a scientific data pricing method based on the participants' data contribution score to federated models, so that all data providers can be compensated fairly. Firstly, an accurate measurement method of the data contribution score of each federated participant to the global model is constructed based on shapely values for Monte Carlo optimization. Then, taking the data contribution score as the input variable, we formulate a data pricing game model based on Stackelberg with the hosts as the leader and the guest as the follower in VFedL. We further solve our model and analyze the guest's optimal data usage strategy based on data contribution score and the hosts' optimal data pricing strategy. Our method has been proven through numerical experiments to precisely assess the data contribution score of participants with the Federated Logistic Regression model. These study findings can also offer management direction for the FedL service providers.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
ywzwszl完成签到,获得积分0
刚刚
yar完成签到 ,获得积分10
刚刚
Ava应助陈星采纳,获得10
刚刚
森源海完成签到,获得积分10
1秒前
蛋挞豆花完成签到,获得积分10
1秒前
约定完成签到,获得积分10
1秒前
sunshine_920完成签到,获得积分10
2秒前
Lucas应助科研通管家采纳,获得10
2秒前
小可应助科研通管家采纳,获得10
3秒前
无花果应助科研通管家采纳,获得10
3秒前
脑洞疼应助科研通管家采纳,获得10
3秒前
Akim应助科研通管家采纳,获得30
3秒前
HHHHHQ完成签到,获得积分20
3秒前
3秒前
clcl完成签到,获得积分10
3秒前
3秒前
数据女工应助科研通管家采纳,获得30
3秒前
yummmy发布了新的文献求助10
3秒前
dew应助科研通管家采纳,获得10
3秒前
汉黑碧玺琉璃板完成签到,获得积分10
3秒前
顾矜应助科研通管家采纳,获得10
4秒前
糊涂的雅琴给xiaoao的求助进行了留言
4秒前
4秒前
4秒前
大模型应助科研通管家采纳,获得30
4秒前
再慕发布了新的文献求助10
4秒前
打打应助omega采纳,获得10
4秒前
4秒前
刘纯青发布了新的文献求助10
4秒前
科目三应助科研通管家采纳,获得10
4秒前
数据女工应助科研通管家采纳,获得30
4秒前
JamesPei应助yinxi采纳,获得10
5秒前
5秒前
大模型应助科研通管家采纳,获得10
5秒前
上官若男应助科研通管家采纳,获得30
5秒前
Zhu完成签到,获得积分10
5秒前
烟花应助科研通管家采纳,获得10
5秒前
一一完成签到,获得积分10
5秒前
麦子应助科研通管家采纳,获得10
5秒前
ZZ完成签到,获得积分10
5秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
卤化钙钛矿人工突触的研究 1000
Engineering for calcareous sediments : proceedings of the International Conference on Calcareous Sediments, Perth 15-18 March 1988 / edited by R.J. Jewell, D.C. Andrews 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 610
2026 Hospital Accreditation Standards 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6263447
求助须知:如何正确求助?哪些是违规求助? 8085291
关于积分的说明 16894713
捐赠科研通 5333825
什么是DOI,文献DOI怎么找? 2839101
邀请新用户注册赠送积分活动 1816652
关于科研通互助平台的介绍 1670331