反推
强化学习
观察员(物理)
计算机科学
控制理论(社会学)
人工神经网络
非线性系统
理论(学习稳定性)
Lyapunov稳定性
国家观察员
多智能体系统
李雅普诺夫函数
协议(科学)
自适应控制
国家(计算机科学)
数学优化
控制(管理)
数学
人工智能
算法
医学
机器学习
物理
病理
量子力学
替代医学
作者
Jie Lan,Yan‐Jun Liu,Dengxiu Yu,Guoxing Wen,Shaocheng Tong,Lei Liu
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2022-04-13
卷期号:35 (3): 3144-3155
被引量:36
标识
DOI:10.1109/tnnls.2022.3158085
摘要
This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.
科研通智能强力驱动
Strongly Powered by AbleSci AI