强化学习
计算机科学
群(周期表)
跟踪(教育)
多智能体系统
控制(管理)
钢筋
人工智能
分布式计算
心理学
社会心理学
化学
教育学
有机化学
作者
Jun Li,Lianghao Ji,Cuijuan Zhang,Huaqing Li
标识
DOI:10.1016/j.ins.2022.07.181
摘要
In this paper, we study a class of optimal couple-group tracking control (OCGTC) problems for heterogeneous multi-agent systems (HeMASs) based on reinforcement learning (RL) method, whose goal is to minimize the local tracking errors (states) and control inputs (actions) of followers by learning the dynamic knowledge of a single leader. The weakly connected multi-agent network is randomly divided into coupled sub-networks, and each agent in the same sub-network cooperates to accomplish tracking control such that the positions and velocities of all the agents converge to the same value, while the agents from different subgroups compete with each other to dissimilar tracking goals. In particular, in the discussed HeMASs, we consider agents with unknown dynamics of first-order and second-order. To solve the algebraic Riccati equation (ARE), an policy-value-based actor-critic technique is applied. Using the Lyapunov-like theorem, we verify that the local tracking error and the estimated weights of actor-critic neural networks are deduced to be uniformly ultimately bounded. Eventually, several simulations demonstrate the correctness of the retrieved theoretical results.
科研通智能强力驱动
Strongly Powered by AbleSci AI