后悔
库存控制
计算机科学
审查(临床试验)
上下界
控制(管理)
销售损失
计量经济学
正多边形
经济
运筹学
运营管理
数学
人工智能
数学分析
机器学习
几何学
摘要
We study inventory control involving lost sales and hence censored demand. In a long‐run average framework, the demand distribution is largely unknown. As long as the stationary inventory costs are strictly convex to the extent that the second lost item costs strictly more than the first one, the regret would be [Formula: see text]. Our discrete‐item setting has rendered the presence or absence of strong censoring indicators or equivalently, being knowledgeable or ignorant of one more demand request after the depletion of the inventory, a critical issue and any gradient‐based method designed for the continuous‐item case ineffective. We propose a policy that deliberately orders up to very high levels in designated learning periods and in the remaining doing periods, uses base‐stock levels tailored to near‐empirical distributions formed over the learning periods. A matching [Formula: see text] upper bound can be achieved by this policy. The results can hold even when items are nonperishable. Numerical experiments further illustrate the relative competitiveness of our separate learning‐doing policy.
科研通智能强力驱动
Strongly Powered by AbleSci AI