装箱问题
启发式
计算机科学
容器(类型理论)
强化学习
分类
箱子
包装问题
任务(项目管理)
数学优化
人工智能
算法
工程类
数学
机械工程
系统工程
作者
Ching‐Chang Wong,Tai-Ting Tsai,Can-Kun Ou
出处
期刊:Sensors
[MDPI AG]
日期:2024-08-20
卷期号:24 (16): 5370-5370
被引量:1
摘要
This study proposes a method named Hybrid Heuristic Proximal Policy Optimization (HHPPO) to implement online 3D bin-packing tasks. Some heuristic algorithms for bin-packing and the Proximal Policy Optimization (PPO) algorithm of deep reinforcement learning are integrated to implement this method. In the heuristic algorithms for bin-packing, an extreme point priority sorting method is proposed to sort the generated extreme points according to their waste spaces to improve space utilization. In addition, a 3D grid representation of the space status of the container is used, and some partial support constraints are proposed to increase the possibilities for stacking objects and enhance overall space utilization. In the PPO algorithm, some heuristic algorithms are integrated, and the reward function and the action space of the policy network are designed so that the proposed method can effectively complete the online 3D bin-packing task. Some experimental results illustrate that the proposed method has good results in achieving online 3D bin-packing tasks in some simulation environments. In addition, an environment with image vision is constructed to show that the proposed method indeed enables an actual robot manipulator to successfully and effectively complete the bin-packing task in a real environment.
科研通智能强力驱动
Strongly Powered by AbleSci AI