强化学习
计算机科学
最优控制
数学优化
控制器(灌溉)
非线性系统
控制理论(社会学)
最优化问题
理论(学习稳定性)
功能(生物学)
火车
控制(管理)
数学
算法
人工智能
物理
地图学
量子力学
机器学习
进化生物学
农学
生物
地理
作者
Fatemeh Mahdavi Golmisheh,Saeed Shamaghdari
标识
DOI:10.1016/j.amc.2023.128302
摘要
This article presents the problem of distributed training with a decentralized execution policy as a safe, optimal formation control for a heterogeneous nonlinear multi-agent system. The control objective is to guarantee safety while achieving optimal performance. This objective is achieved by introducing novel distributed optimization problems with cost and local control barrier functions (CBFs). Designing an optimal formation controller is defined as optimal performance and modeled by a cost function. A local CBF trains a safe controller to ensure multi-agent systems operate within the safe regions. Instead of optimizing constrained optimization problems, this method generates safe, optimal controllers from unconstrained optimization problems by utilizing local CBFs. As a result, the presented approach has a lower computational cost than constrained optimization problems. It is proven that the proposed controller's optimality and stability are not affected by adding the local CBF to the cost function. A safe, optimal policy is iteratively derived using a new off-policy multi-agent reinforcement learning (MARL) algorithm that does not need knowledge of the agents' dynamics. Finally, the effectiveness of the proposed algorithm is evaluated through simulation of the collision-free problem of the multi-quadrotor formation control.
科研通智能强力驱动
Strongly Powered by AbleSci AI