清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Shrinking the Upper Confidence Bound: A Dynamic Product Selection Problem for Urban Warehouses

后悔 上下界 计算机科学 数学优化 运筹学 产品(数学) 数学 机器学习 几何学 数学分析
作者
Rong Jin,David Simchi‐Levi,Li Wang,Xinshang Wang,Sen Yang
出处
期刊:Management Science [Institute for Operations Research and the Management Sciences]
卷期号:67 (8): 4756-4771 被引量:5
标识
DOI:10.1287/mnsc.2020.3773
摘要

The recent rising popularity of ultrafast delivery services on retail platforms fuels the increasing use of urban warehouses, whose proximity to customers makes fast deliveries viable. The space limit in urban warehouses poses a problem for such online retailers: the number of stock keeping units (SKUs) they carry is no longer “the more, the better,” yet it can still be significantly large, reaching hundreds or thousands in a product category. In this paper, we study algorithms for dynamically selecting a large number of products (i.e., SKUs) with top customer purchase probabilities on the fly, from an ocean of potential products to offer on retailers’ ultrafast delivery platforms. We distill the product selection problem into a semibandit model with linear generalization. There are in total N arms corresponding to N products, each with a feature vector of dimension d. The player pulls K arms in each period and observes the bandit feedback from each of the pulled arms. We focus on the setting where K is much greater than the number of total time periods T or the dimension of product features d. We first analyze a standard Upper Confidence Bound (UCB) algorithm and show its regret bound can be expressed as the sum of a T-independent part and a T-dependent part, which we refer to as “fixed cost” and “variable cost,” respectively. To reduce the fixed cost for large K values, we propose a novel online learning algorithm, which iteratively shrinks the upper confidence bounds within each period, and show its fixed cost is reduced by a factor of d. Moreover, we test the algorithms on an industrial data set from Alibaba Group. Experimental results show that our new algorithm reduces the total regret of the standard UCB algorithm by at least 10%. This paper was accepted by J. George Shanthikumar, big data analytics.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
随心所欲完成签到 ,获得积分10
15秒前
波里舞完成签到 ,获得积分0
30秒前
希望天下0贩的0应助Yuanyuan采纳,获得10
53秒前
1分钟前
Yuanyuan发布了新的文献求助10
1分钟前
冉亦完成签到,获得积分10
1分钟前
Jasper应助科研通管家采纳,获得10
2分钟前
神经完成签到,获得积分10
2分钟前
2分钟前
Radisson发布了新的文献求助10
2分钟前
junjie完成签到 ,获得积分10
3分钟前
Imran完成签到,获得积分10
3分钟前
3分钟前
Radisson完成签到,获得积分10
4分钟前
量子星尘发布了新的文献求助10
4分钟前
星辰大海应助Yuanyuan采纳,获得10
4分钟前
4分钟前
4分钟前
北辰zdx完成签到,获得积分10
4分钟前
4分钟前
Yuanyuan发布了新的文献求助10
4分钟前
北辰zdx发布了新的文献求助10
4分钟前
4分钟前
4分钟前
杨珊珊发布了新的文献求助10
4分钟前
上官若男应助Saikikus0采纳,获得10
4分钟前
Ava应助杨珊珊采纳,获得10
4分钟前
阿洁发布了新的文献求助10
5分钟前
sisi完成签到,获得积分20
5分钟前
5分钟前
orixero应助Kawhichan采纳,获得30
5分钟前
5分钟前
sisi发布了新的文献求助10
5分钟前
Saikikus0发布了新的文献求助10
5分钟前
何小盒发布了新的文献求助20
5分钟前
852应助Yuanyuan采纳,获得10
5分钟前
常有李完成签到,获得积分10
5分钟前
5分钟前
Yuanyuan发布了新的文献求助10
6分钟前
willcrystal完成签到 ,获得积分10
6分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Aerospace Standards Index - 2026 ASIN2026 3000
Polymorphism and polytypism in crystals 1000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
Social Work and Social Welfare: An Invitation(7th Edition) 410
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6051085
求助须知:如何正确求助?哪些是违规求助? 7854338
关于积分的说明 16267197
捐赠科研通 5196156
什么是DOI,文献DOI怎么找? 2780496
邀请新用户注册赠送积分活动 1763426
关于科研通互助平台的介绍 1645435