强化学习
避障
计算机科学
机械臂
任务(项目管理)
避碰
弹道
人工智能
运动规划
适应性
过程(计算)
控制(管理)
障碍物
运动(物理)
运动控制
机器人
控制工程
模拟
移动机器人
工程类
碰撞
计算机安全
政治学
物理
法学
系统工程
操作系统
生态学
生物
天文
作者
Shilin Yang,Qingling Wang
标识
DOI:10.23919/ccc55666.2022.9902722
摘要
The traditional control method of the robotic arm requires the operator to control it through a preset fixed trajectory according to the specific task and combined with the working environment, which requires a high-precision environment model. However, it lacks adaptability and cannot be applied to other working scenarios. This paper proposes an end-to-end robotic arm control method based on DRL(Deep Reinforcement Learning) to overcome the above problems. The strategy control module of the robotic arm uses the PPO(Proximal Policy Optimization) algorithm, so that the robotic arm has the ability to learn independently in a complex working environment and completes the adaptive control. In this paper, the reward shaping method is adopted in the training process of the agent, which accelerates the learning of the agent and makes the algorithm converge faster. The DRL algorithm can converge in a shorter time as shown in experimental results, and it has excellent performance in the motion planning, collision avoidance and overall strategy control of the robotic arm in the simulation environment.
科研通智能强力驱动
Strongly Powered by AbleSci AI