控制理论(社会学)
加权
数学优化
计算机科学
最优控制
贝尔曼方程
趋同(经济学)
解耦(概率)
纳什均衡
异步通信
数学
控制(管理)
控制工程
人工智能
工程类
医学
计算机网络
经济
放射科
经济增长
作者
Hao Zhang,Yan Li,Zhuping Wang,Yi Ding,Huaicheng Yan
出处
期刊:IEEE Transactions on Network Science and Engineering
[Institute of Electrical and Electronics Engineers]
日期:2023-08-29
卷期号:11 (1): 872-885
被引量:4
标识
DOI:10.1109/tnse.2023.3309816
摘要
Aiming at the distributed optimal control problem in presence of external disturbance, a data-driven iterative algorithm based on policy gradient is proposed for multi-agent systems (MAS) in this article. Taking into full consideration the influence of itself and local neighbor perturbation information, the optimization problem is replaced by a zero-sum game problem with control and disturbance policies. Then, the control behavior under distributed conditions is evaluated using the action-state value function to achieve decoupling of system parameters. Afterwards, adaptive dynamic programming (ADP) method is designed based on the actor-critic neural network structure to obtain the Nash equilibrium solution of the zero-sum game. The rules for updating the network weights are given by utilizing the residual weighting method based on both online and offline data. Besides, asynchronous update iteration control technique is developed to address the situation that other agents do not update their policies at the same time. Sufficient conditions for stability and convergence are established through a rigorous proof. The validity of the established data-driven control method is demonstrated by the simulation results.
科研通智能强力驱动
Strongly Powered by AbleSci AI