强化学习
计算机科学
多样性(政治)
推荐系统
特征(语言学)
人工智能
过程(计算)
机器学习
光学(聚焦)
社会学
人类学
语言学
哲学
物理
光学
操作系统
作者
Zihan Wang,Feng Shi,Daling Wang,Kaisong Song,Gang Wu,Yifei Zhang,Han Zhao,Yu Ge
出处
期刊:Research Square - Research Square
日期:2024-08-13
标识
DOI:10.21203/rs.3.rs-4692909/v1
摘要
Abstract Multi-round Conversational Recommendation (MRCR) system assists users in finding the items they need with the fewest dialogue rounds by inquiring about desired features or making tailored recommendations. Numerous models employ single-agent Reinforcement Learning (RL) to accomplish MRCR and improve recommendation accuracy. However, they overlook the diversity of conversational recommendations and primarily focus on popular features or items. It impacts the fair visibility of the items and results in an unbalanced user experience. We propose a diversity-enhanced conversational recommendation model (DECREC), which is built on our proposed multi-agent RL framework. Three agents col-laboratively determine the actions at each round of the MRCR and each agent autonomously explores and learns distinct facets of the task. Compared to a single agent, their collaboration fosters the exploration of a more extensive array of actions to improve diversity. Furthermore, we introduce a dynamic experience replay method that balances long-tail and head data ensuring each learning batch includes long-tail samples, keeping the model attentive to these less common but important data. Moreover, we integrate feature entropy into the feature value estimation process during training to encourage the model to explore a broader spectrum of features, thereby indirectly enhancing the diversity of recommendation results. Extensive experiments on four public datasets demonstrate that DECREC reduces bias in MRCR and achieves optimal recommendation diversity and accuracy. Our code is available at https://github.com/wzhwzhwzh0921/ DECREC.
科研通智能强力驱动
Strongly Powered by AbleSci AI