Peduncle collision-free grasping based on deep reinforcement learning for tomato harvesting robot

花序梗（解剖学）强化学习人工智能机器人启发式机器人末端执行器计算机科学计算机视觉模拟碰撞功能（生物学）控制理论（社会学）生物植物计算机安全控制（管理）进化生物学

作者

Yajun Li,Qingchun Feng,Yifan Zhang,Chuanlang Peng,Yuhang Ma,Cheng Liu,Mengfei Ru,Jiahui Sun,Chunjiang Zhao

出处

期刊：Computers and Electronics in Agriculture [Elsevier]
日期：2023-12-06 卷期号：216: 108488-108488 被引量：35

标识

DOI：10.1016/j.compag.2023.108488

摘要

Collision-free grasping of the thin, brief peduncles connecting cherry tomato clusters to the main stem was crucial for tomato harvesting robots. Recognizing that the optimal operating posture for each individual peduncle was various, this study proposed a novel peduncle grasping posture decision model using deep reinforcement learning (DRL) for tomato harvesting manipulators, to overcome the collision issue caused by fixed-posture grasping. This model could dynamically generated action sequences for the harvesting manipulator, ensuring that the end-effector approach to the peduncle along the collision-free path with the optimal grasping posture. Building upon prior research into the multi-task identification of tomato clusters, peduncles, and the main stem, a keypoint-based spatial pose description model for tomato bunches was devised. Through this, the optimal operating posture for the end-effector on the peduncle was established. An improved HER-SAC (Soft Actor Critic with Hindsight Experience Replay) algorithm was subsequently established to guide the end-effector in collision-free grasping motions. The reward function of this algorithm incorporated end-effector posture constraints obtained from the optimal posture plane. In the training phase, a heuristic strategy model, providing prior knowledge, was merged with a dynamic gain module to sidestep local optimal policies, collectively enhancing the learning efficiency. In the simulation, our method improved the success rate of the peduncle grasping by at least 14 %, compared with SAC, HER-DDPG and HER-TD3. For the identical scenarios, improved HER-SAC reached the desired posture with a minimum of 15.5 % fewer steps compared to other algorithms. In field experiments conducted in tomato greenhouses, the robot achieved a harvesting success rate of 85.5 %, which was an increase of 57.3 % and 43.0 % compared to traditional methods with fixed horizontal and parallel-to-main-stem postures, respectively. The average operation time, from identification to successful harvesting, was 11.42 s. Our findings offer a promising solution to enhancing the efficiency of tomato-harvesting robots.

求助该文献

最长约 10秒，即可获得该文献文件

Peduncle collision-free grasping based on deep reinforcement learning for tomato harvesting robot

今日热心研友