计算机科学
机械臂
机器人
控制(管理)
控制工程
控制系统
控制器(灌溉)
运动学
机器人学
标识
DOI:10.1007/978-981-16-6320-8_7
摘要
The traditional robotic arm control method is based on the precise mathematical model of the task and lacks adaptability. When the environment or task changes, the control effect is greatly compromised or even out of control. In recent years, Deep Reinforcement Learning (DRL), which has achieved great success in games, has been introduced into the control of robotic arms. TD3 (Twin Delayed Deep Deterministic Policy Gradient) is an improved algorithm based on DDPG (Deep Deterministic Policy Gradient). Like other DRL algorithms, TD3 also has the problem of low learning efficiency. This paper proposes a improved TD3 algorithm which can converge faster than TD3 algorithm in terms of reachability and obstacle avoidance. Finally, the improvement of the algorithm is verified by a simulation research on a 6-DOF ABB-IRB1200 robotic arm.
科研通智能强力驱动
Strongly Powered by AbleSci AI