汉密尔顿-雅各比-贝尔曼方程
强化学习
正确性
计算机科学
趋同(经济学)
数学优化
同步(交流)
非线性系统
观察员(物理)
状态空间
国家(计算机科学)
控制理论(社会学)
最优控制
数学
控制(管理)
人工智能
算法
统计
经济
频道(广播)
物理
量子力学
经济增长
计算机网络
作者
Lina Xia,Qing Li,Ruizhuo Song,Hamidreza Modares
出处
期刊:IEEE/CAA Journal of Automatica Sinica
[Institute of Electrical and Electronics Engineers]
日期:2021-12-10
卷期号:9 (3): 520-532
被引量:33
标识
DOI:10.1109/jas.2021.1004359
摘要
The asymmetric input-constrained optimal synchronization problem of heterogeneous unknown nonlinear multiagent systems (MASs) is considered in the paper. Intuitively, a state-space transformation is performed such that satisfaction of symmetric input constraints for the transformed system guarantees satisfaction of asymmetric input constraints for the original system. Then, considering that the leader's information is not available to every follower, a novel distributed observer is designed to estimate the leader's state using only exchange of information among neighboring followers. After that, a network of augmented systems is constructed by combining observers and followers dynamics. A nonquadratic cost function is then leveraged for each augmented system (agent) for which its optimization satisfies input constraints and its corresponding constrained Hamilton-Jacobi-Bellman (HJB) equation is solved in a data-based fashion. More specifically, a data-based off-policy reinforcement learning (RL) algorithm is presented to learn the solution to the constrained HJB equation without requiring the complete knowledge of the agents' dynamics. Convergence of the improved RL algorithm to the solution to the constrained HJB equation is also demonstrated. Finally, the correctness and validity of the theoretical results are demonstrated by a simulation example.
科研通智能强力驱动
Strongly Powered by AbleSci AI