强化学习
纳什均衡
计算机科学
数学优化
汉密尔顿-雅各比-贝尔曼方程
趋同(经济学)
多智能体系统
贝尔曼方程
图形
数学
理论计算机科学
人工智能
经济增长
经济
作者
Mohammed Abouheaf,Frank L. Lewis
标识
DOI:10.1109/cdc.2013.6760804
摘要
This paper studies a class of multi-agent graphical games denoted by differential graphical games, where interactions between agents are prescribed by a communication graph structure. Ideas from cooperative control are given to achieve synchronization among the agents to a leader dynamics. New coupled Bellman and Hamilton-Jacobi-Bellman equations are developed for this class of games using Integral Reinforcement Learning. Nash solutions are given in terms of solutions to a set of coupled continuous-time Hamilton-Jacobi-Bellman equations. A multi-agent policy iteration algorithm is given to learn the Nash solution in real time without knowing the complete dynamic models of the agents. A proof of convergence for this algorithm is given. An online multi-agent method based on policy iterations is developed using a critic network to solve all the Hamilton-Jacobi-Bellman equations simultaneously for the graphical game.
科研通智能强力驱动
Strongly Powered by AbleSci AI