强化学习
事后诸葛亮
计算机科学
机器人
避障
运动规划
人工智能
机器学习
任务(项目管理)
路径(计算)
人机交互
移动机器人
工程类
系统工程
认知心理学
程序设计语言
心理学
作者
Jiaqi Wang,Huiyan Han,Xie Han,Liqun Kuang,Xiaowen Yang
出处
期刊:Displays
[Elsevier]
日期:2024-09-01
卷期号:84: 102796-102796
标识
DOI:10.1016/j.displa.2024.102796
摘要
Home service robots prioritize cost-effectiveness and convenience over the precision required for industrial tasks like autonomous driving, making their task execution more easily. Meanwhile, path planning tasks using Deep Reinforcement Learning(DRL) are commonly sparse reward problems with limited data utilization, posing challenges in obtaining meaningful rewards during training, consequently resulting in slow or challenging training. In response to these challenges, our paper introduces a lightweight end-to-end path planning algorithm employing with hindsight experience replay(HER). Initially, we optimize the reinforcement learning training process from scratch and map the complex high-dimensional action space and state space to the representative low-dimensional action space. At the same time, we improve the network structure to decouple the model navigation and obstacle avoidance module to meet the requirements of lightweight. Subsequently, we integrate HER and curriculum learning (CL) to tackle issues related to inefficient training. Additionally, we propose a multi-step hindsight experience replay (MS-HER) specifically for the path planning task, markedly enhancing both training efficiency and model generalization across diverse environments. To substantiate the enhanced training efficiency of the refined algorithm, we conducted tests within diverse Gazebo simulation environments. Results of the experiments reveal noteworthy enhancements in critical metrics, including success rate and training efficiency. To further ascertain the enhanced algorithm's generalization capability, we evaluate its performance in some "never-before-seen" simulation environment. Ultimately, we deploy the trained model onto a real lightweight robot for validation. The experimental outcomes indicate the model's competence in successfully executing the path planning task, even on a small robot with constrained computational resources.
科研通智能强力驱动
Strongly Powered by AbleSci AI