启发式
计算机科学
强化学习
可扩展性
仓库
拣选订单
启发式
订单(交换)
适应(眼睛)
人工智能
数据库
业务
营销
操作系统
光学
物理
财务
作者
Aleksandar Krnjaic,J. Hywel Thomas,Georgios Papoudakis,Lukas Schäfer,Peter Börsting,Stefano V. Albrecht
出处
期刊:Cornell University - arXiv
日期:2022-12-22
标识
DOI:10.48550/arxiv.2212.11498
摘要
We envision a warehouse in which dozens of mobile robots and human pickers work together to collect and deliver items within the warehouse. The fundamental problem we tackle, called the order-picking problem, is how these worker agents must coordinate their movement and actions in the warehouse to maximise performance (e.g. order throughput). Established industry methods using heuristic approaches require large engineering efforts to optimise for innately variable warehouse configurations. In contrast, multi-agent reinforcement learning (MARL) can be flexibly applied to diverse warehouse configurations (e.g. size, layout, number/types of workers, item replenishment frequency), as the agents learn through experience how to optimally cooperate with one another. We develop hierarchical MARL algorithms in which a manager assigns goals to worker agents, and the policies of the manager and workers are co-trained toward maximising a global objective (e.g. pick rate). Our hierarchical algorithms achieve significant gains in sample efficiency and overall pick rates over baseline MARL algorithms in diverse warehouse configurations, and substantially outperform two established industry heuristics for order-picking systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI