计算机科学
强化学习
学习迁移
人工智能
学习分类器系统
错误驱动学习
机器学习
作者
Xiaoguang Li,Wan-Ting Ji,Jun Huang
标识
DOI:10.1016/j.engappai.2024.108488
摘要
Similarity-based transfer learning for reinforcement learning has garnered attention for its potential to enhance target task learning. However, it faces significant challenges in efficiency and effectiveness, primarily stemming from issues such as sparse reward, long trajectory, and strict similarity. To solve these problems, this paper proposes a local instance-based transfer learning method for reinforcement learning. Instead of relying on sparse reward and long trajectory, this approach leverages the Q value of the local trajectory to evaluate similarity, thereby significantly enhancing transfer efficiency. Furthermore, by relaxing the strictness of the similarity, three transfer policies are proposed to facilitate positive transfer. Extensive experimental results demonstrate that the effectiveness and efficiency of the proposed method in comparison with traditional similarity-based transfer learning methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI