Multitask Learning and Bandits via Robust Statistics

计算机科学 估计员 后悔 Lasso(编程语言) 背景(考古学) 自举(财务) 机器学习 嵌入 人工智能 数学 计量经济学 统计 生物 万维网 古生物学
作者
Xu Kan,Hamsa Bastani
出处
期刊:Management Science [Institute for Operations Research and the Management Sciences]
被引量:3
标识
DOI:10.1287/mnsc.2022.00490
摘要

Decision makers often simultaneously face many related but heterogeneous learning problems. For instance, a large retailer may wish to learn product demand at different stores to solve pricing or inventory problems, making it desirable to learn jointly for stores serving similar customers; alternatively, a hospital network may wish to learn patient risk at different providers to allocate personalized interventions, making it desirable to learn jointly for hospitals serving similar patient populations. Motivated by real data sets, we study a natural setting where the unknown parameter in each learning instance can be decomposed into a shared global parameter plus a sparse instance-specific term. We propose a novel two-stage multitask learning estimator that exploits this structure in a sample-efficient way, using a unique combination of robust statistics (to learn across similar instances) and LASSO regression (to debias the results). Our estimator yields improved sample complexity bounds in the feature dimension d relative to commonly employed estimators; this improvement is exponential for “data-poor” instances, which benefit the most from multitask learning. We illustrate the utility of these results for online learning by embedding our multitask estimator within simultaneous contextual bandit algorithms. We specify a dynamic calibration of our estimator to appropriately balance the bias-variance trade-off over time, improving the resulting regret bounds in the context dimension d. Finally, we illustrate the value of our approach on synthetic and real data sets. This paper was accepted by J. George Shanthikumar, data science. Supplemental Material: The online appendix and data files are available at https://doi.org/10.1287/mnsc.2022.00490 .
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
斯文败类应助CV16采纳,获得10
刚刚
刚刚
The发布了新的文献求助10
1秒前
伟大的德玛完成签到,获得积分10
1秒前
Jara应助病毒遗传学采纳,获得10
1秒前
gzhcanadagz发布了新的文献求助10
1秒前
尘埃完成签到,获得积分10
2秒前
li完成签到,获得积分10
2秒前
陆离发布了新的文献求助10
3秒前
4秒前
辛勤的玉米完成签到,获得积分10
4秒前
4秒前
小周发布了新的文献求助10
4秒前
阿格发布了新的文献求助10
4秒前
研友_Z3vemn发布了新的文献求助10
4秒前
叶雨乐发布了新的文献求助10
5秒前
小蘑菇应助鉨汏闫采纳,获得10
5秒前
华仔应助Jason采纳,获得10
5秒前
英吉利25发布了新的文献求助10
5秒前
5秒前
6秒前
6秒前
6秒前
王博完成签到,获得积分10
8秒前
czx完成签到,获得积分20
8秒前
平淡的鸿煊完成签到,获得积分20
8秒前
llltc完成签到 ,获得积分10
8秒前
ding应助坚定小翠采纳,获得10
9秒前
123完成签到 ,获得积分10
9秒前
cc发布了新的文献求助10
10秒前
打屁飞发布了新的文献求助30
12秒前
12秒前
lzgjy完成签到,获得积分10
12秒前
研友_ngkyGn发布了新的文献求助10
12秒前
13秒前
yuyu完成签到,获得积分10
13秒前
14秒前
14秒前
15秒前
研友_Z3vemn完成签到,获得积分10
15秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Cronologia da história de Macau 1600
Decentring Leadership 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
Intentional optical interference with precision weapons (in Russian) Преднамеренные оптические помехи высокоточному оружию 1000
Atlas of Anatomy 5th original digital 2025的PDF高清电子版(非压缩版,大小约400-600兆,能更大就更好了) 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6184643
求助须知:如何正确求助?哪些是违规求助? 8011975
关于积分的说明 16664934
捐赠科研通 5283833
什么是DOI,文献DOI怎么找? 2816664
邀请新用户注册赠送积分活动 1796436
关于科研通互助平台的介绍 1660993