Reinforcement learning for Hybrid Disassembly Line Balancing Problems

强化学习 计算机科学 利润(经济学) 数学优化 产品线 生产线 比例(比率) 工业工程 人工智能 制造工程 数学 工程类 经济 微观经济学 物理 机械工程 量子力学
作者
Jiacun Wang,GuiPeng Xi,Xiwang Guo,Shixin Liu,Shujin Qin,Henry Han
出处
期刊:Neurocomputing [Elsevier BV]
卷期号:569: 127145-127145 被引量:6
标识
DOI:10.1016/j.neucom.2023.127145
摘要

With the rapid development of the economy and technology, the rate of product replacement has accelerated, resulting in a large number of products being discarded. Disassembly is an important way to recycle waste products, which is also helpful to reduce manufacturing costs and environmental pollution. The combination of a single-row linear disassembly line and a U-shaped disassembly line presents distinctive advantages within various application scenarios. The Hybrid Disassembly Line Balancing Problem (HDLBP) that considers the requirement of multi-skilled workers is addressed in this paper. A mathematical model is established to maximize the recovery profit according to the characteristics of the proposed problem. To facilitate the search for optimal solution, a new strategy for agents in reinforcement learning to interact with complex and changeable environments in real-time is developed, and deep reinforcement learning is used to complete the distribution of multi-products and disassembly tasks. On this basis, we propose a Soft Actor-Critic (SAC) algorithm to effectively address this problem. Compared with the Deep Deterministic Policy Gradient (DDPG) algorithm, Advantage Actor-Critic (A2C) algorithm, and Proximal Policy Optimization (PPO) algorithm, the results show that the SAC can get the approximate optimal result on small-scale cases. The performance of SAC is also better than DDPG, PPO, and A2C in solving large-scale disassembly cases.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
1秒前
1秒前
1秒前
兴奋奇异果完成签到,获得积分10
2秒前
2秒前
3秒前
科研通AI6.2应助顺顺顺采纳,获得10
4秒前
4秒前
天真的梦容完成签到 ,获得积分10
4秒前
LLLLLispector发布了新的文献求助10
5秒前
5秒前
yydy完成签到,获得积分10
6秒前
安安发布了新的文献求助10
6秒前
6秒前
7秒前
7秒前
科研通AI6.3应助灿澈采纳,获得10
7秒前
8秒前
dongtan发布了新的文献求助10
9秒前
yang完成签到,获得积分10
9秒前
一一发布了新的文献求助10
9秒前
10秒前
11秒前
tq发布了新的文献求助10
11秒前
hql_sdu发布了新的文献求助10
12秒前
12秒前
七喜完成签到,获得积分10
12秒前
不会画画发布了新的文献求助10
13秒前
Orange应助zzzzlll采纳,获得10
14秒前
15秒前
魏笑白发布了新的文献求助10
16秒前
蜗壳ccc发布了新的文献求助10
16秒前
tq完成签到,获得积分10
18秒前
Lucas应助NATIESNAFTANG采纳,获得10
18秒前
18秒前
活力的之双完成签到,获得积分20
19秒前
19秒前
21秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Bounds for Statistical Estimation in Semiparametric Models 500
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6466993
求助须知:如何正确求助?哪些是违规求助? 8273199
关于积分的说明 17640227
捐赠科研通 5542187
什么是DOI,文献DOI怎么找? 2908098
邀请新用户注册赠送积分活动 1885061
关于科研通互助平台的介绍 1733378