马尔可夫决策过程
数学优化
计算机科学
收入
定量配给
出租
过程(计算)
功能(生物学)
运筹学
启发式
持续时间(音乐)
灵敏度(控制系统)
马尔可夫过程
数学
经济
财务
操作系统
文学类
政治学
工程类
艺术
统计
医疗保健
生物
进化生物学
经济增长
法学
电子工程
作者
Xufeng Yang,Wen Jiao,Juliang Zhang,Hong Yan
摘要
This work explores the admission and capacity allocation for a leasing system with two types of equipment and three kinds of batch demands: elementary specified, premium specified, and unspecified demands. The demands arrive following mutually independent Poisson processes, and the rental duration of equipment follows a negative exponential distribution. The lessor can satisfy partially the specified demands with the required type of equipment and satisfy partially the unspecified demands with any type of equipment. The objective is to maximize the expected discounted revenue. We formulate this problem as a Markov decision process, prove the anti‐multimodularity of the value function, and characterize the structure of the optimal policy. We show that the optimal policy has a simple structure and is characterized by state‐dependent rationing and priority thresholds. Moreover, a solution algorithm is proposed to calculate the optimal policy. We study the impacts of the system state on the optimal action and find that the optimal action has limited sensitivity to the system state. Numerical studies are conducted to compare the performance of the optimal policy with that of two heuristic methods and to derive some managerial insights by analysis. We further discuss batch acceptance.
科研通智能强力驱动
Strongly Powered by AbleSci AI