强化学习
模块化设计
计算机科学
背景(考古学)
网格
分布式计算
钥匙(锁)
软件
软件框架
人工智能
软件系统
基于构件的软件工程
操作系统
古生物学
几何学
数学
生物
作者
David Biagioni,Xiangyu Zhang,Dylan Wald,Deepthi Vaidhynathan,Rohit Chintala,Jennifer King,Ahmed S. Zamzam
出处
期刊:Cornell University - arXiv
日期:2021-01-01
被引量:5
标识
DOI:10.48550/arxiv.2111.05969
摘要
We present the PowerGridworld software package to provide users with a lightweight, modular, and customizable framework for creating power-systems-focused, multi-agent Gym environments that readily integrate with existing training frameworks for reinforcement learning (RL). Although many frameworks exist for training multi-agent RL (MARL) policies, none can rapidly prototype and develop the environments themselves, especially in the context of heterogeneous (composite, multi-device) power systems where power flow solutions are required to define grid-level variables and costs. PowerGridworld is an open-source software package that helps to fill this gap. To highlight PowerGridworld's key features, we present two case studies and demonstrate learning MARL policies using both OpenAI's multi-agent deep deterministic policy gradient (MADDPG) and RLLib's proximal policy optimization (PPO) algorithms. In both cases, at least some subset of agents incorporates elements of the power flow solution at each time step as part of their reward (negative cost) structures.
科研通智能强力驱动
Strongly Powered by AbleSci AI