趋同(经济学)
单调函数
加速度
计算机科学
迭代学习控制
控制理论(社会学)
非线性系统
数学优化
收敛速度
序列(生物学)
区间(图论)
功能(生物学)
数学
控制(管理)
人工智能
钥匙(锁)
数学分析
物理
计算机安全
经典力学
量子力学
组合数学
进化生物学
生物
经济
遗传学
经济增长
作者
Ding Wang,Yuan Wang,Mingming Ha,Jin Ren,Junfei Qiao
摘要
Abstract In this article, an adaptive critic scheme with a novel performance index function is developed to solve the tracking control problem, which eliminates the tracking error and possesses the adjustable convergence rate in the offline learning process. Under some conditions, the convergence and monotonicity of the accelerated value function sequence can be guaranteed. Combining the advantages of the adjustable and general value iteration schemes, an integrated algorithm is proposed with a fast guaranteed convergence, which involves two stages, namely the acceleration stage and the convergence stage. Moreover, an effective approach is given to adaptively determine the acceleration interval. With this operation, the fast convergence of the new value iteration scheme can be fully utilized. Finally, compared with the general value iteration, the numerical results are presented to verify the fast convergence and the tracking performance of the developed adaptive critic design.
科研通智能强力驱动
Strongly Powered by AbleSci AI