遏制(计算机编程)
计算机科学
贝尔曼方程
趋同(经济学)
数学优化
最优控制
启发式
动态规划
功能(生物学)
控制(管理)
强化学习
纳什均衡
动作(物理)
人工神经网络
数学
人工智能
算法
量子力学
进化生物学
生物
物理
经济增长
经济
程序设计语言
标识
DOI:10.1016/j.neucom.2018.06.011
摘要
This paper deals with the model-free optimal containment control problem for a class of linear multi-agent systems (MASs). In the existing results concerning containment control of MASs, the dynamics of the MASs is required to be completely known. Differently, in this paper, we propose a new distributed self-learning control scheme based on action dependent heuristic dynamic programming (ADHDP) to achieve containment control, where the model of MASs is no longer needed. The containment control problem is first transformed into a regulation problem on the dynamics of the designed local containment error. The policy iteration method based on the designed state-action value function (also called the Q-function) is introduced to deal with such a regulation problem. The convergence analysis of this policy iteration method is also given. Neural network (NN) based actor-critic framework is adopted to approximate the optimal Q-functions and the optimal control policies for facilitating the implementation of the proposed method. It shows that the approximated control policies achieve the containment control and satisfy the global Nash equilibrium. Finally, the simulation results are provided to demonstrate the effectiveness of the developed approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI