强化学习
计算机科学
利润(经济学)
数学优化
产品线
生产线
比例(比率)
工业工程
人工智能
制造工程
数学
机械工程
工程类
经济
微观经济学
物理
量子力学
作者
Jiacun Wang,GuiPeng Xi,Xiwang Guo,Shixin Liu,Shujin Qin,Henry Han
出处
期刊:Neurocomputing
[Elsevier]
日期:2023-12-11
卷期号:569: 127145-127145
被引量:4
标识
DOI:10.1016/j.neucom.2023.127145
摘要
With the rapid development of the economy and technology, the rate of product replacement has accelerated, resulting in a large number of products being discarded. Disassembly is an important way to recycle waste products, which is also helpful to reduce manufacturing costs and environmental pollution. The combination of a single-row linear disassembly line and a U-shaped disassembly line presents distinctive advantages within various application scenarios. The Hybrid Disassembly Line Balancing Problem (HDLBP) that considers the requirement of multi-skilled workers is addressed in this paper. A mathematical model is established to maximize the recovery profit according to the characteristics of the proposed problem. To facilitate the search for optimal solution, a new strategy for agents in reinforcement learning to interact with complex and changeable environments in real-time is developed, and deep reinforcement learning is used to complete the distribution of multi-products and disassembly tasks. On this basis, we propose a Soft Actor-Critic (SAC) algorithm to effectively address this problem. Compared with the Deep Deterministic Policy Gradient (DDPG) algorithm, Advantage Actor-Critic (A2C) algorithm, and Proximal Policy Optimization (PPO) algorithm, the results show that the SAC can get the approximate optimal result on small-scale cases. The performance of SAC is also better than DDPG, PPO, and A2C in solving large-scale disassembly cases.
科研通智能强力驱动
Strongly Powered by AbleSci AI