发布文献求助

A Deep Reinforcement Learning Approach to Improve the Learning Performance in Process Control

强化学习计算机科学 PID控制器适应性过程（计算）控制器（灌溉）人工智能非线性系统控制理论（社会学）贝尔曼方程国家（计算机科学）时差学习控制（管理）机器学习控制工程数学优化算法数学工程类物理操作系统生物量子力学生态学温度控制农学

作者

Yaoyao Bao,Yuanming Zhu,Feng Qian

出处

期刊：Industrial & Engineering Chemistry Research [American Chemical Society]
日期：2021-04-06 卷期号：60 (15): 5504-5515 被引量：43

标识

DOI：10.1021/acs.iecr.0c05678

摘要

Advanced model-based control methods have been widely used in industrial process control, but excellent performance requires regular maintenance of its model. Reinforcement learning can online update its policy through the observed data by interacting with the environment. Since a fast and stable learning process is required to improve the adaptability of the controller, we propose an improved deep deterministic actor critic predictor in this paper, where the immediate reward is separated from the action-value function to provide the actor with reliable gradient information at early stages. Then, an expectation form of policy gradient is developed based on the assumption that the state obeys the normal distribution. Simulation results show that the proposed algorithm achieves a more stable and faster learning procedure than those state-of-art deep reinforcement learning (DRL) algorithms. Meanwhile, the obtained policy achieves a more advantageous performance than the fine-tuned proportional integral and derivative (PID) and linear model predictive controllers, especially for those processes with nonlinearity. These indicate that the improved DRL controller has the potential to become an important tool in practical applications.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

祝大家在新的一年里科研腾飞

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: hhnicai发布了新的文献求助10

1秒前; 所所上传了应助文件

2秒前; wangyr11发布了新的文献求助10

3秒前; Trevino上传了应助文件

4秒前; 蒽蒽上传了应助文件

4秒前; 乐乐的应助被机灵的嘉熙采纳，获得10

5秒前; 匆匆关闭了匆匆的文献求助

5秒前; 白河完成签到，获得积分10

6秒前; 毫无头绪的豆沙包发布了新的文献求助10

6秒前; Arthur完成签到，获得积分10

8秒前; 高大的画板发布了新的文献求助10

8秒前; 白河发布了新的文献求助30

9秒前; 稻草人完成签到，获得积分10

13秒前; Owen上传了应助文件

13秒前; 萧水白上传了应助文件

15秒前; 大模型的应助被猫滩儿采纳，获得10

16秒前; zwy发布了新的文献求助20

16秒前; Estrella发布了新的文献求助10

17秒前; 共享精神上传了应助文件

18秒前; 高大的画板完成签到，获得积分10

19秒前; 上官若男的应助被单向度的人采纳，获得10

21秒前; 大模型上传了应助文件

22秒前; 吉小聿发布了新的文献求助10

24秒前; Yikehudou发布了新的文献求助200

24秒前; 萧寒发布了新的文献求助10

24秒前; Owen的应助被安静的早晨采纳，获得10

24秒前; 猫滩儿发布了新的文献求助10

25秒前; 顾矜的应助被小刘鸭鸭采纳，获得10

25秒前; gying上传了应助文件

25秒前; 爆米花的应助被Estrella采纳，获得10

26秒前; 方羽的应助被llll采纳，获得20

26秒前; 大气夜山完成签到，获得积分10

26秒前; 隐形曼青上传了应助文件

28秒前; 上官若男上传了应助文件

28秒前; 火星上的亦巧发布了新的文献求助10

29秒前; 田様上传了应助文件

29秒前; 文静菠萝完成签到，获得积分20

30秒前; 白问寒发布了新的文献求助10

31秒前; 星辰大海上传了应助文件

31秒前; ppf发布了新的文献求助10

32秒前

高分求助中: Востребованный временем 2500; Les Mantodea de Guyane 1000; Very-high-order BVD Schemes Using β-variable THINC Method 970; Field Guide to Insects of South Africa 660; Foucault's Technologies Another Way of Cutting Reality 500; Forensic Chemistry 400; Toward personalized care for insomnia in the US Army: a machine learning model to predict response to cognitive behavioral therapy for insomnia 300

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3392344; 求助须知：如何正确求助？哪些是违规求助？ 3003047; 关于积分的说明 8807005; 捐赠科研通 2689807; 什么是DOI，文献DOI怎么找？ 1473309; 科研通“疑难数据库（出版商）”最低求助积分说明 681498; 邀请新用户注册赠送积分活动 674316

今日热心研友

失眠的诗蕊

小鱼爱吃肉

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通