强化学习
需求方
能源管理
钢筋
能量(信号处理)
计算机科学
业务
工程类
环境经济学
人工智能
经济
结构工程
统计
数学
作者
Jiejie Liu,Yanan Ma,Ying Chen,Chunxia Zhao,Xianyang Meng,Jiangtao Wu
出处
期刊:Energy
[Elsevier BV]
日期:2025-02-15
卷期号:319: 135056-135056
被引量:19
标识
DOI:10.1016/j.energy.2025.135056
摘要
Incorporating multiple flexible resources and energy sharing into the regional integrated energy system (RIES) provides an attractive pathway for resilience enhancement. However, traditional model-based optimization methods are not sufficiently flexible to deal with benefit games of multiple entities and complex multi-energy flows of RIES. Therefore, this work proposes a cooperative energy management framework using multi-agent deep reinforcement learning (MADRL) for optimal operation. Firstly, the collaborative optimization between shared energy storage, IES energy stations and users is developed, in which users could make subjective decisions to participate in demand response and the shared energy storage is employed to coordinate energy balance. Secondly, the cooperative optimization is formulated as a Markov decision process . The multi-agent twin delayed deep deterministic policy (MATD3) is leveraged to tackle the optimal scheduling problem, aiming at operation profits and user satisfaction. Thirdly, an imitation actor-attention critic (IAAC) mechanism is proposed, which could assist actors in learning effective strategies and generate more accurate state-action value function of critics. The results show that the proposed IAAC-MATD3 algorithm exhibits the fastest convergence compared with baseline algorithms. The operation cost of cooperation optimization is better than those of three baseline scenarios and achieves an improvement of 43.7 %, 19.9 %, and 34.6 %, respectively. • A cooperative operation strategy among shared energy storage, IES energy stations and users of RIES is proposed. • The multi-agent deep reinforcement learning is used to solve optimal energy management problem. • An imitation actor-attention critic mechanism is proposed to enhance the training performance of agents. • The proposed algorithm is better than those of the baseline algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI