计算机科学
计算机网络
路由器
强化学习
网络数据包
分布式计算
网络拥塞
包转发
路由协议
核心路由器
单臂路由器
网络性能
链路状态包
处理延迟
传输延迟
人工智能
作者
Ruijin Ding,Yuwen Yang,Jun Liu,Hongyan Li,Feifei Gao
出处
期刊:2020 International Conference on Computing, Networking and Communications (ICNC)
日期:2020-02-01
被引量:10
标识
DOI:10.1109/icnc47757.2020.9049759
摘要
The continuous growth of the network data would lead to the increased network congestion and the throughput decline. In this paper, we investigate the packet routing problem based on deep multi-agent reinforcement learning, where each router chooses the next hop router by itself intelligently. We design the modified deep Q-network in each router to evaluate the neighbor routers. The routers, each acting as an agent, choose the next hop router based on their local observation. Then they transfer the packets to the chosen routers and receive the reward and the observation of the next hop routers. Using their experience, the routers learn to improve the packet routing strategy by updating their Q-networks. We demonstrate that with proper reward set and training mechanism, the routers in the network can work in a distributed way to reduce the computational complexity compared with the single-agent reinforcement learning based algorithm. And the proposed algorithm can further reduce the congestion probability and improve the network performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI