Adaptive optimal process control with actor-critic design for energy-efficient batch machining subject to time-varying tool wear

机械加工 刀具磨损 元启发式 能源消耗 过程(计算) 机床 强化学习 能量(信号处理) 计算机科学 批量生产 工程类 数学优化 控制工程 机械工程 人工智能 数学 电气工程 操作系统 统计
作者
Qinge Xiao,Zhile Yang,Yingfeng Zhang,Pai Zheng
出处
期刊:Journal of Manufacturing Systems [Elsevier BV]
卷期号:67: 80-96 被引量:12
标识
DOI:10.1016/j.jmsy.2023.01.005
摘要

Batch machining systems are essential for improving productivity and quality, but they consume considerable amounts of energy due to the continuous interaction with machine tools, workpieces, and cutting tools. In contrast to single-piece machining that has a short production cycle, the tool wear impacts in batch machining systems on energy consumption cannot be underestimated. However, few studies have focused on adaptive process control subject to time-varying tool wear because process optimization has always been previously considered a static problem. As an alternative to metaheuristic algorithms, reinforcement learning (RL) offers an attractive means for solving such a dynamic, high-dimensional, and high-coupling problem. In the case of turning cylindrical parts, an energy-efficient decision model is developed for the process control of pass operations of batch machining. The decision variables are decoupled by reformulating the problem as the Markov decision process, wherein the tool wear experiences dynamic changes. To solve the problem, an actor-critic RL framework with multi-constraint and multi-objective design is developed. Based on the framework, a dynamic process control method is proposed where the RL agent observes workpiece features, machining requirements, and tool wear states (inputs) and adaptively selects the control parameters such as cutting speed, feed rate, and cutting rate (outputs), with the aim to conserve energy. Two application tests and comparisons against metaheuristic methods are performed. The results indicate that the method can further reduce energy by over 20% compared with energy-efficient optimization ignoring tool wear effects. The learning efficiency of RL is about three times faster than that of metaheuristics. The online sampling time is less than 0.1 millisecond, which facilitates real-time control of process parameters.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
阚乐乐完成签到 ,获得积分10
刚刚
蓝绝完成签到,获得积分10
2秒前
马耳完成签到,获得积分10
2秒前
kongchao008完成签到,获得积分10
4秒前
7秒前
健忘青牛完成签到 ,获得积分10
7秒前
7秒前
乐观的翠琴完成签到 ,获得积分10
9秒前
不知道叫个啥完成签到 ,获得积分10
9秒前
平常的半莲完成签到 ,获得积分10
12秒前
缥缈的雁枫完成签到,获得积分10
12秒前
杨杨杨完成签到 ,获得积分10
12秒前
竹竹发布了新的文献求助30
12秒前
无痕梦完成签到 ,获得积分10
18秒前
梅夕阳完成签到,获得积分10
20秒前
季秋完成签到,获得积分10
20秒前
微雨若,,完成签到 ,获得积分10
20秒前
辛勤冬天应助xh采纳,获得10
20秒前
xue完成签到 ,获得积分10
21秒前
太少拿米完成签到,获得积分10
22秒前
竹竹完成签到,获得积分10
24秒前
25秒前
zhunyun完成签到 ,获得积分10
26秒前
TianFuAI完成签到,获得积分10
26秒前
科研通AI6.1应助研友_LMBAXn采纳,获得10
27秒前
院士完成签到,获得积分10
27秒前
温柔的曼梅完成签到 ,获得积分10
29秒前
WXF完成签到 ,获得积分10
30秒前
贝贝完成签到 ,获得积分10
30秒前
英勇雅琴完成签到 ,获得积分10
32秒前
8R60d8应助科研通管家采纳,获得10
32秒前
8R60d8应助科研通管家采纳,获得10
32秒前
baozeNG完成签到,获得积分10
32秒前
Nexus应助科研通管家采纳,获得10
32秒前
Nexus应助科研通管家采纳,获得10
32秒前
Niko完成签到,获得积分10
34秒前
小宇完成签到,获得积分10
39秒前
40秒前
柔叶完成签到 ,获得积分10
40秒前
嘟嘟豆806完成签到 ,获得积分0
42秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2500
卤化钙钛矿人工突触的研究 2000
Моделирование процессов самоорганизации в кристаллообразующих системах 1000
History of U.S. Space Surveillance and Satellite Cataloging 1000
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6508422
求助须知:如何正确求助?哪些是违规求助? 8301411
关于积分的说明 17721814
捐赠科研通 5609198
什么是DOI,文献DOI怎么找? 2921779
邀请新用户注册赠送积分活动 1898969
关于科研通互助平台的介绍 1761581