远程直接内存访问
计算机科学
块(置换群论)
操作系统
数学
几何学
作者
Yuzhen Su,Jiao Zhang,Zirui Wan,Pingping Lin,Yunpeng Zhang,Tian Pan,Tao Huang
标识
DOI:10.1109/iccc59590.2023.10507657
摘要
In Remote Direct Memory Access (RDMA) data center networks, incast has a serious impact on non-incast flows due to its many-to-one and synchronous characteristics. With the increasing link rate, the response speed of RDMA Congestion Control (CC) alone fails to effectively cope with incast. However, existing RDMA incast solutions have limitations. They are unable to control the incast in time and require extensive device modifications. We propose Hermes, an efficient incast control scheme based on Top-of-Rack (ToR) switches. Hermes achieves accurate incast identification with the shortest control loop through the traffic monitor and uses the Virtual Output Queue (VOQ) to converge quickly to the proper rate of incast traffic. The core idea of Hermes is to minimize the accumulation of incast traffic on core switches and improve the performance of non-incast flows. Hermes only modifies ToR switches and is compatible with existing RDMA CCs. We evaluate Hermes through our Testbed and NS3 simulations. The results show that Hermes can reduce the average and 99th Flow Completion Time (FCT) slowdown of incast victim flows by up to 20% and 55% respectively, compared with the state-of-the-art solution.
科研通智能强力驱动
Strongly Powered by AbleSci AI