Adaptive optimal process control with actor-critic design for energy-efficient batch machining subject to time-varying tool wear

机械加工刀具磨损元启发式能源消耗过程（计算）机床强化学习能量（信号处理）计算机科学批量生产工程类数学优化控制工程机械工程人工智能数学电气工程操作系统统计

作者

Qinge Xiao,Zhile Yang,Yingfeng Zhang,Pai Zheng

出处

期刊：Journal of Manufacturing Systems [Elsevier]
日期：2023-01-16 卷期号：67: 80-96 被引量：12

标识

DOI：10.1016/j.jmsy.2023.01.005

摘要

Batch machining systems are essential for improving productivity and quality, but they consume considerable amounts of energy due to the continuous interaction with machine tools, workpieces, and cutting tools. In contrast to single-piece machining that has a short production cycle, the tool wear impacts in batch machining systems on energy consumption cannot be underestimated. However, few studies have focused on adaptive process control subject to time-varying tool wear because process optimization has always been previously considered a static problem. As an alternative to metaheuristic algorithms, reinforcement learning (RL) offers an attractive means for solving such a dynamic, high-dimensional, and high-coupling problem. In the case of turning cylindrical parts, an energy-efficient decision model is developed for the process control of pass operations of batch machining. The decision variables are decoupled by reformulating the problem as the Markov decision process, wherein the tool wear experiences dynamic changes. To solve the problem, an actor-critic RL framework with multi-constraint and multi-objective design is developed. Based on the framework, a dynamic process control method is proposed where the RL agent observes workpiece features, machining requirements, and tool wear states (inputs) and adaptively selects the control parameters such as cutting speed, feed rate, and cutting rate (outputs), with the aim to conserve energy. Two application tests and comparisons against metaheuristic methods are performed. The results indicate that the method can further reduce energy by over 20% compared with energy-efficient optimization ignoring tool wear effects. The learning efficiency of RL is about three times faster than that of metaheuristics. The online sampling time is less than 0.1 millisecond, which facilitates real-time control of process parameters.

求助该文献

Adaptive optimal process control with actor-critic design for energy-efficient batch machining subject to time-varying tool wear

今日热心研友