反推
控制理论(社会学)
强化学习
非线性系统
计算机科学
有界函数
人工神经网络
跟踪误差
控制器(灌溉)
趋同(经济学)
贝尔曼方程
鞍点
数学优化
数学
自适应控制
控制(管理)
人工智能
数学分析
物理
量子力学
农学
经济
生物
经济增长
几何学
作者
Jia Long,Dengxiu Yu,Guoxing Wen,Li Li,Zhen Wang,C. L. Philip Chen
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:: 1-14
被引量:6
标识
DOI:10.1109/tnnls.2022.3177461
摘要
In this article, the game-based backstepping control method is proposed for the high-order nonlinear multi-agent system with unknown dynamic and input saturation. Reinforcement learning (RL) is employed to get the saddle point solution of the tracking game between each agent and the reference signal for achieving robust control. Specifically, the approximate optimal solution of the established Hamilton-Jacobi-Isaacs (HJI) equation is obtained by policy iteration for each subsystem, and the single network adaptive critic (SNAC) architecture is used to reduce the computational burden. In addition, based on the separation operation of the error term from the derivative of the value function, we achieve the different proportions of the two agents in the game to realize the regulation of the final equilibrium point. Different from the general use of the neural network for system identification, the unknown nonlinear dynamic term is approximated based on the state difference obtained by the command filter. Furthermore, a sufficient condition is established to guarantee that the whole system and each subsystem included are uniformly ultimately bounded. Finally, simulation results are given to show the effectiveness of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI