后悔
共同价值拍卖
计算机科学
数理经济学
数学优化
经济
微观经济学
人工智能
计量经济学
机器学习
数学
作者
Yanjun Han,Tsachy Weissman,Zhengyuan Zhou
出处
期刊:Operations Research
[Institute for Operations Research and the Management Sciences]
日期:2024-07-08
被引量:4
标识
DOI:10.1287/opre.2020.0282
摘要
In “Optimal No-Regret Learning in Repeated First-Price Auctions,” Y. Han, W. Tsachy, and Z. Zhou study online learning in repeated first-price auctions where a bidder, only observing the winning bid at the end of each auction, learns to adaptively bid to maximize her cumulative payoff. To achieve this goal, the bidder faces censored feedback: If she wins the bid, then she is not able to observe the highest bid of the other bidders, which we assume is i.i.d. drawn from an unknown distribution. In this paper, they develop the first learning algorithm that achieves a near-optimal regret bound, by exploiting two structural properties of first-price auctions, that is, the specific feedback structure and payoff function.
科研通智能强力驱动
Strongly Powered by AbleSci AI