发布文献求助

已入深夜，您辛苦了！由于当前在线用户较少，发布求助请尽量完整的填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！祝你早点完成任务，早点休息，好梦！

Reinforcement learning for Hybrid Disassembly Line Balancing Problems

强化学习计算机科学利润（经济学）数学优化产品线生产线比例（比率）工业工程人工智能制造工程数学机械工程工程类经济微观经济学物理量子力学

作者

Jiacun Wang,GuiPeng Xi,Xiwang Guo,Shixin Liu,Shujin Qin,Henry Han

出处

期刊：Neurocomputing [Elsevier]
日期：2023-12-11 卷期号：569: 127145-127145 被引量：4

标识

DOI：10.1016/j.neucom.2023.127145

摘要

With the rapid development of the economy and technology, the rate of product replacement has accelerated, resulting in a large number of products being discarded. Disassembly is an important way to recycle waste products, which is also helpful to reduce manufacturing costs and environmental pollution. The combination of a single-row linear disassembly line and a U-shaped disassembly line presents distinctive advantages within various application scenarios. The Hybrid Disassembly Line Balancing Problem (HDLBP) that considers the requirement of multi-skilled workers is addressed in this paper. A mathematical model is established to maximize the recovery profit according to the characteristics of the proposed problem. To facilitate the search for optimal solution, a new strategy for agents in reinforcement learning to interact with complex and changeable environments in real-time is developed, and deep reinforcement learning is used to complete the distribution of multi-products and disassembly tasks. On this basis, we propose a Soft Actor-Critic (SAC) algorithm to effectively address this problem. Compared with the Deep Deterministic Policy Gradient (DDPG) algorithm, Advantage Actor-Critic (A2C) algorithm, and Proximal Policy Optimization (PPO) algorithm, the results show that the SAC can get the approximate optimal result on small-scale cases. The performance of SAC is also better than DDPG, PPO, and A2C in solving large-scale disassembly cases.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: cc发布了新的文献求助10

刚刚; Murphy完成签到，获得积分10

1秒前; 科研通AI2S上传了应助文件

2秒前; 爱静静给chen的求助进行了留言

10秒前; 小潘完成签到，获得积分10

18秒前; 北斗HH完成签到，获得积分10

20秒前; dai完成签到，获得积分10

22秒前; 研友_闾丘枫完成签到，获得积分10

23秒前; Orange上传了应助文件

23秒前; 半夏完成签到，获得积分10

25秒前; 半夏发布了新的文献求助10

29秒前; 嘟嘟嘟嘟完成签到，获得积分10

31秒前; zzzyyy的应助被dai采纳，获得10

32秒前; 追寻的莺完成签到，获得积分10

35秒前; 心灵美语兰完成签到，获得积分10

35秒前; 虚心的惮完成签到，获得积分10

38秒前; feixiangmeng发布了新的文献求助10

40秒前; 林狗完成签到，获得积分10

45秒前; Joseph_sss完成签到，获得积分10

45秒前; xiao完成签到，获得积分10

48秒前; Orange上传了应助文件

48秒前; 给好评完成签到，获得积分20

49秒前; 奋斗怀柔完成签到，获得积分10

49秒前; 给好评发布了新的文献求助10

52秒前; 爱静静上传了应助文件

53秒前; LOT完成签到，获得积分10

54秒前; 小马甲的应助被科研通管家采纳，获得10

1分钟前; eternity136的应助被科研通管家采纳，获得10

1分钟前; Lucas的应助被科研通管家采纳，获得10

1分钟前; 不甜完成签到，获得积分10

1分钟前; 科研通AI2S的应助被YUE采纳，获得10

1分钟前; 猪仔5号完成签到，获得积分10

1分钟前; 小孟要努力完成签到，获得积分10

1分钟前; TY完成签到，获得积分10

1分钟前; CodeCraft上传了应助文件

1分钟前; 科研通AI2.0上传了应助文件

1分钟前; bosslin完成签到，获得积分10

1分钟前; bonhiver完成签到，获得积分10

1分钟前; feixiangmeng完成签到，获得积分10

1分钟前; qiao完成签到，获得积分10

1分钟前

高分求助中: Sustainability in Tides Chemistry 2800; The Young builders of New china : the visit of the delegation of the WFDY to the Chinese People's Republic 1000; Rechtsphilosophie 1000; Bayesian Models of Cognition:Reverse Engineering the Mind 888; Le dégorgement réflexe des Acridiens 800; Defense against predation 800; Very-high-order BVD Schemes Using β-variable THINC Method 568

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3136964; 求助须知：如何正确求助？哪些是违规求助？ 2787951; 关于积分的说明 7783990; 捐赠科研通 2443991; 什么是DOI，文献DOI怎么找？ 1299549; 科研通“疑难数据库（出版商）”最低求助积分说明 625477; 版权声明 600954

今日热心研友

乐乐乐乐乐乐

睡觉晒太阳

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通