计算机科学
强化学习
稳健性(进化)
控制器(灌溉)
人工智能
农学
生物化学
生物
基因
化学
作者
Jiawen Li,Haoyang Cui,Wei Jiang
标识
DOI:10.1016/j.engappai.2023.105818
摘要
In order to sustain solid oxide fuel cell (SOFC) net output power and prevent violation of oxygen excess ratio (OER) constraint and fuel utilization (FU) constraint, a data-driven gas supply system coordination management method is proposed. Accordingly, a population evolution-based multi-agent double delay deep deterministic policy gradient (PE-MA4DPG) algorithm is introduced. The artificial intelligence design of the algorithm is guided by the concepts of imitation learning and curriculum learning, whereby different agents of different combinations are trained in different environments, thus improving the robustness of the coordination strategy. In this algorithm, the hydrogen controller and the air controller are treated as two agents. The centralized training enables agents with different objectives to coordinate with each other. The effectiveness of the proposed algorithm is demonstrated in three experiments, wherein the proposed algorithm is compared with a group of existing algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI