Optimal Policies for Dynamic Pricing and Inventory Control with Nonparametric Censored Demands

后悔上下界非参数统计估计员数学优化数学计算机科学计量经济学数理经济学统计数学分析

作者

Boxiao Chen,Yining Wang,Yuan Zhou

出处

期刊：Management Science [Institute for Operations Research and the Management Sciences]
日期：2023-08-31 卷期号：70 (5): 3362-3380 被引量：21

标识

DOI：10.1287/mnsc.2023.4859

摘要

We study the classic model of joint pricing and inventory control with lost sales over T consecutive review periods. The firm does not know the demand distribution a priori and needs to learn it from historical censored demand data. We develop nonparametric online learning algorithms that converge to the clairvoyant optimal policy at the fastest possible speed. The fundamental challenges rely on that neither zeroth-order nor first-order feedbacks are accessible to the firm and reward at any single price is not observable due to demand censoring. We propose a novel inversion method based on empirical measures to consistently estimate the difference of the instantaneous reward functions at two prices, directly tackling the fundamental challenge brought by censored demands. Based on this technical innovation, we design bisection and trisection search methods that attain an [Formula: see text] regret for the case with concave reward functions, and we design an active tournament elimination method that attains [Formula: see text] regret when the reward functions are nonconcave. We complement the [Formula: see text] regret upper bound with a matching [Formula: see text] regret lower bound. The lower bound is established by a novel information-theoretical argument based on generalized squared Hellinger distance, which is significantly different from conventional arguments that are based on Kullback-Leibler divergence. Both the upper bound technique based on the “difference estimator” and the lower bound technique based on generalized Hellinger distance are new in the literature, and can be potentially applied to solve other inventory or censored demand type problems that involve learning. This paper was accepted by Jeannette Song, operations management. Supplemental Material: The data files and online appendix are available at https://doi.org/10.1287/mnsc.2023.4859 .

求助该文献

最长约 10秒，即可获得该文献文件

Optimal Policies for Dynamic Pricing and Inventory Control with Nonparametric Censored Demands

今日热心研友