排名(信息检索)
计算机科学
强化学习
推荐系统
选择(遗传算法)
比例(比率)
功能(生物学)
机器学习
人工智能
量子力学
进化生物学
生物
物理
作者
Dingcheng Li,Xu Li,Jun Wang,Ping Li
标识
DOI:10.1145/3397271.3401238
摘要
In this paper, we propose a reinforcement learning based large scale multi-objective ranking system for optimizing short-video recommendation on an industrial video sharing platform. Multiple competing ranking objective and implicit selection bias in user feedback are the main challenges in real-world platform. In order to address those challenges, we integrate multi-gate mixture of experts and soft actor critic into the ranking system. We demonstrated that our proposed framework can greatly reduce the loss function compared with systems only based on single strategies.
科研通智能强力驱动
Strongly Powered by AbleSci AI