Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality

计算机科学 数学优化 渐近最优算法 动态定价 放松(心理学) 资源配置 国家(计算机科学) 适应性策略 动态规划 运筹学 数学 经济 微观经济学 算法 社会心理学 考古 历史 计算机网络 心理学
作者
David B. Brown,Jingwei Zhang
出处
期刊:Operations Research [Institute for Operations Research and the Management Sciences]
卷期号:70 (5): 3015-3033 被引量:10
标识
DOI:10.1287/opre.2021.2181
摘要

Allocating Resources Across Systems Coupled by Shared Information Many sequential decision problems involve repeatedly allocating a limited resource across subsystems that are jointly affected by randomly evolving exogenous factors. For example, in adaptive clinical trials, a decision maker needs to allocate patients to treatments in an effort to learn about the efficacy of treatments, but the number of available patients may vary randomly over time. In capital budgeting problems, firms may allocate resources to conduct R&D on new products, but funding budgets may evolve randomly. In many inventory management problems, firms need to allocate limited production capacity to satisfy uncertain demands at multiple locations, and these demands may be correlated due to vagaries in shared market conditions. In this paper, we develop a model involving “shared resources and signals” that captures these and potentially many other applications. The framework is naturally described as a stochastic dynamic program, but this problem is quite difficult to solve. We develop an approximation method based on a “dynamic fluid relaxation”: in this approximation, the subsystem state evolution is approximated by a deterministic fluid model, but the exogenous states (the signals) retain their stochastic evolution. We develop an algorithm for solving the dynamic fluid relaxation. We analyze the corresponding feasible policies and performance bounds from the dynamic fluid relaxation and show that these are asymptotically optimal as the number of subsystems grows large. We show that competing state-of-the-art approaches used in the literature on weakly coupled dynamic programs in general fail to provide asymptotic optimality. Finally, we illustrate the approach on the aforementioned dynamic capital budgeting and multilocation inventory management problems.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
爆米花应助mmmy采纳,获得10
1秒前
1秒前
852发布了新的文献求助20
1秒前
欣喜豌豆发布了新的文献求助10
2秒前
tiantianwang完成签到,获得积分10
3秒前
3秒前
沉静的黎昕完成签到,获得积分10
4秒前
lalaland发布了新的文献求助10
5秒前
哭泣吐司发布了新的文献求助10
6秒前
dlf完成签到,获得积分20
6秒前
852应助xlxlxl采纳,获得10
6秒前
Latti发布了新的文献求助10
6秒前
充电宝应助隐形的从阳采纳,获得30
7秒前
7秒前
8秒前
筱梦完成签到,获得积分10
8秒前
8秒前
luoguixun发布了新的文献求助10
8秒前
第五明月完成签到,获得积分10
9秒前
ZZzz完成签到,获得积分10
10秒前
yami完成签到,获得积分10
10秒前
凌波何处发布了新的文献求助10
11秒前
天天快乐应助任性云朵采纳,获得10
12秒前
浦肯野完成签到,获得积分10
13秒前
liu发布了新的文献求助10
13秒前
13秒前
刘楚卿完成签到,获得积分10
14秒前
大个应助kk采纳,获得10
14秒前
14秒前
ethereal发布了新的文献求助10
14秒前
桢桢树完成签到,获得积分10
15秒前
16秒前
共享精神应助Latti采纳,获得10
17秒前
浦肯野发布了新的文献求助10
17秒前
17秒前
17秒前
lalaland完成签到,获得积分10
18秒前
18秒前
健壮荠完成签到,获得积分10
20秒前
kbb应助咸鱼采纳,获得30
20秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Modern Epidemiology, Fourth Edition 5000
Handbook of pharmaceutical excipients, Ninth edition 5000
Digital Twins of Advanced Materials Processing 2000
Weaponeering, Fourth Edition – Two Volume SET 2000
Polymorphism and polytypism in crystals 1000
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 生物化学 化学工程 物理 计算机科学 复合材料 内科学 催化作用 物理化学 光电子学 电极 冶金 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6023244
求助须知:如何正确求助?哪些是违规求助? 7649440
关于积分的说明 16172418
捐赠科研通 5171739
什么是DOI,文献DOI怎么找? 2767271
邀请新用户注册赠送积分活动 1750619
关于科研通互助平台的介绍 1637138