斯塔克伯格竞赛
计算机科学
强化学习
分布式计算
趋同(经济学)
群体行为
灵活性(工程)
人工智能
过程(计算)
理论(学习稳定性)
机器学习
统计
数学
数理经济学
经济
经济增长
操作系统
作者
Wenji He,Haipeng Yao,Tianle Mai,Fu Wang,Mohsen Guizani
出处
期刊:IEEE Transactions on Vehicular Technology
[Institute of Electrical and Electronics Engineers]
日期:2023-02-23
卷期号:72 (7): 9366-9380
被引量:29
标识
DOI:10.1109/tvt.2023.3246636
摘要
In the past decade, the unmanned aerial vehicles (UAVs) swarm has become a disruptive force reshaping our lives and work. In particular, advances in artificial intelligence have allowed multiple UAVs to coordinate their operations and work together to accomplish various complex tasks, one of which is Federated Learning (FL). As a promising distributed learning paradigm, FL can be adopted well with the limited resources and dynamic network topology of UAV swarms. However, the current FL's training process relies on homogeneous data paradigms, which require distributed UAVs to hold the same structure data. This ideal hypothesis can not apply to the heterogeneous UAV swarms. To tackle this problem, in this paper, we design a clustered federated learning (CFL) architecture, in which we cluster UAV swarms based on the similarities between the participants' optimization directions. Then, we formulate the model trading among model owners, cluster heads, and UAV workers as a three-stage Stackelberg game to optimize the allocation of the limited resources. We design a hierarchical reinforcement learning algorithm to search for the Stackelberg equilibrium under the clustered federated learning system. The performance evaluation demonstrates the uniqueness and stability of the proposed three-stage leader-follower game under the clustered framework, as well as the convergence and effectiveness of the reinforcement learning algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI