反推
控制理论(社会学)
李雅普诺夫函数
非线性系统
Lyapunov重新设计
控制Lyapunov函数
跟踪(教育)
计算机科学
自适应控制
国家(计算机科学)
控制(管理)
物理
人工智能
算法
心理学
教育学
量子力学
作者
Bolong Zhu,Ning Xu,Guangdeng Zong,Xudong Zhao
摘要
Abstract In this article, the problem of adaptive optimal tracking control is studied for nonlinear strict‐feedback systems. While not directly measurable, the states of these systems are subject to both time‐varying and asymmetric constraints. Bypassing the conventional barrier Lyapunov function method, the constrained system is transformed into its unconstrained counterpart, thereby obviating the need for feasibility conditions. A specially designed reinforcement learning (RL) algorithm, featuring an observer‐critic‐actor architecture, is deployed in an adaptive optimal control scheme to ensure the stabilization of the converted unconstrained system. Within this architecture, the observer estimates the unmeasurable system states, the critic evaluates the control performance, and the actor executes the control actions. Furthermore, enhancements to the RL algorithm lead to relaxed conditions of persistent excitation, and the design methodology for the observer overcomes the restrictions imposed by the Hurwitz equation. The Lyapunov stability theorem is applied for two primary purposes: to ascertain the boundedness of all signals within the closed‐loop system, and to ensure the accuracy of the output signal in tracking the desired reference trajectory. Finally, numerical and practical simulations are provided to corroborate the effectiveness of the proposed control strategy.
科研通智能强力驱动
Strongly Powered by AbleSci AI