计算机科学
货币化
后悔
启发式
功能(生物学)
构造(python库)
大数据
匹配(统计)
任务(项目管理)
机器学习
人工智能
数据挖掘
宏观经济学
程序设计语言
管理
经济
统计
生物
进化生物学
数学
作者
Anish K. Agarwal,Munther A. Dahleh,Tuhin Sarkar
出处
期刊:Economics and Computation
日期:2019-06-17
被引量:83
标识
DOI:10.1145/3328526.3329589
摘要
In this work, we aim to design a data marketplace; a robust real-time matching mechanism to efficiently buy and sell training data for Machine Learning tasks. While the monetization of data and pre-trained models is an essential focus of industry today, there does not exist a market mechanism to price training data and match buyers to sellers while still addressing the associated (computational and other) complexity. The challenge in creating such a market stems from the very nature of data as an asset: (i) it is freely replicable; (ii) its value is inherently combinatorial due to correlation with signal in other data; (iii) prediction tasks and the value of accuracy vary widely; (iv) usefulness of training data is difficult to verify a priori without first applying it to a prediction task. As our main contributions we: (i) propose a mathematical model for a two-sided data market and formally define the key associated challenges; (ii) construct algorithms for such a market to function and analyze how they meet the challenges defined. We highlight two technical contributions: (i) a new notion of fairness required for cooperative games with freely replicable goods; (ii) a truthful, zero regret mechanism to auction a class of combinatorial goods based on utilizing Myerson's payment function and the Multiplicative Weights algorithm. These might be of independent interest.
科研通智能强力驱动
Strongly Powered by AbleSci AI