零和博弈
零(语言学)
计算机科学
纳什均衡
控制理论(社会学)
控制(管理)
数学优化
人工智能
数学
语言学
哲学
作者
Jun Zhao,Yongfeng Lv,Ziliang Zhao
出处
期刊:IEEE Transactions on Circuits and Systems Ii-express Briefs
[Institute of Electrical and Electronics Engineers]
日期:2021-09-13
卷期号:69 (3): 1437-1441
被引量:18
标识
DOI:10.1109/tcsii.2021.3112050
摘要
Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.
科研通智能强力驱动
Strongly Powered by AbleSci AI