强化学习
计算机科学
地铁列车时刻表
微电网
过程(计算)
水准点(测量)
人工智能
弹性(材料科学)
网格
适应性
弹道
机器学习
控制工程
控制(管理)
工程类
操作系统
地理
物理
天文
热力学
生物
数学
生态学
大地测量学
几何学
出处
期刊:IEEE Transactions on Sustainable Energy
[Institute of Electrical and Electronics Engineers]
日期:2022-02-04
卷期号:13 (2): 1062-1072
被引量:58
标识
DOI:10.1109/tste.2022.3148236
摘要
Microgrids can be operated in island mode during utility grid outages to support service restoration and improve system resilience. To schedule and dispatch distributed energy resources (DERs) in an islanded microgrid, conventional model-based methods rely on accurate distribution network models and lack generalization and adaptability. Data-driven methods are promising for DER coordination but face practical challenges such as potential hazards to microgrids during online training and insufficient online training opportunities due to low outage rates. This paper presents a novel two-stage learning framework to identify an optimal restoration strategy. The proposed framework builds on the deep deterministic policy gradient from demonstrations, which is a dataset that contains a trajectory of states and the associated expert actions. At the pre-training stage, imitation learning is applied to equip the control agent with expert experiences to guarantee acceptable initial performance. At the online training stage, action clipping, reward shaping, and expert demonstrations are leveraged to ensure safe exploration while accelerating the training process. The proposed method is illustrated using the IEEE 123-node system and compared with a representative model-based method and the standard deep deterministic policy gradient method to prove solution accuracy and demonstrate increased computational efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI