马尔可夫链
计算机科学
数学优化
链条(单位)
马尔可夫决策过程
运筹学
人工智能
马尔可夫过程
机器学习
数学
统计
物理
天文
作者
Shukai Li,Qi Luo,Zhiyuan Huang,Cong Shi
出处
期刊:Operations Research
[Institute for Operations Research and the Management Sciences]
日期:2024-05-15
标识
DOI:10.1287/opre.2022.0693
摘要
Assortment optimization finds many important applications in both brick-and-mortar and online retailing. Decision makers select a subset of products to offer to customers from a universe of substitutable products, based on the assumption that customers purchase according to a Markov chain choice model, which is a very general choice model encompassing many popular models. The existing literature predominantly assumes that the customer arrival process and the Markov chain choice model parameters are given as input to the stochastic optimization model. However, in practice, decision makers may not have this information and must learn them while maximizing the total expected revenue on the fly. In “Online Learning for Constrained Assortment Optimization under the Markov Chain Choice Model,” S. Li, Q. Luo, Z. Huang, and C. Shi developed a series of online learning algorithms for Markov chain choice-based assortment optimization problems with efficiency, as well as provable performance guarantees.
科研通智能强力驱动
Strongly Powered by AbleSci AI