背景(考古学)
地平线
国家(计算机科学)
功能(生物学)
零和博弈
零(语言学)
计算机科学
最优控制
应用数学
数学
算法
数学优化
纳什均衡
生物
进化生物学
哲学
古生物学
语言学
几何学
作者
Mingxiang Liu,Qianqian Cai,Dandan Li,Wei Meng,Minyue Fu
标识
DOI:10.1016/j.neucom.2023.01.050
摘要
In this paper, we present a Q-learning framework for solving finite-horizon zero-sum game problems involving the H∞ control of linear system without knowing the dynamics. Research in the past mainly focused on solving problems in infinite horizon with completely measurable state. However, in the practical engineering, the system state is not always directly accessible, and it is difficult to solve the time-varying Riccati equation associated with the finite-horizon setting directly either. The main contribution of the proposed model-free algorithm is to determine the optimal output feedback policies without measurement state in finite-horizon setting. To achieve this goal, we first describe the Q-function caused by finite-horizon problems in the context of state feedback, then we parameterize the Q-functions as input–output vectors functions. Finally, the numerical examples on aircraft dynamics demonstrate the algorithm's efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI