计算机科学
强化学习
马尔可夫决策过程
图形
人工智能
趋同(经济学)
人工神经网络
弹道
分布式计算
机器学习
运筹学
马尔可夫过程
理论计算机科学
统计
物理
数学
天文
工程类
经济
经济增长
作者
Ziren Xiao,Peisong Li,Chang Liu,Honghao Gao,Xinheng Wang
标识
DOI:10.1016/j.inffus.2024.102250
摘要
Multi-agent collaborative navigation is prevalent in modern transportation systems, including delivery logistics, warehouse automation, and personalised tourism, where multiple agents must converge at a common destination from different starting points. However, the challenges lie in optimising routes for multiple agents while dynamically adjusting the common destination in response to changing traffic conditions. Therefore, we propose a generic Multi-Agent Collaborative Navigation System (MACNS) to address the challenges. First, we formalise the solution of the problem and challenges into a Markov Decision Process (MDP), which is further developed as a training environment where a Deep Reinforcement Learning (DRL) agent can learn patterns efficiently. Second, the proposed framework integrates a Graph Neural Network (GNN) into the policy network of Proximal Policy Optimisation for the homogeneous decision-making of each individual agent, showing good generalisation and convergence speed. Finally, we demonstrate how MACNS can be applied and implemented in a real-world use case. Extensive simulations and real-world tests validate the effectiveness of the MACNS-based use case, showcasing its superiority over other state-of-the-art PPO-related methods in terms of planned routes and user experience.
科研通智能强力驱动
Strongly Powered by AbleSci AI