重传
计算机网络
计算机科学
网络拥塞
显式拥塞通知
数据包丢失
网络数据包
流量控制(数据)
远程直接内存访问
解雇
吞吐量
实时计算
操作系统
慢启动
工程类
无线
机械工程
作者
Sirui Zhou,Yizhou Gong,Zhefan Fan,Yunfang Chen,Wei Zhang,Wanting Tian,Yunqu Liu
标识
DOI:10.1109/iccc56324.2022.10065950
摘要
In recent years, the use of RDMA in data center networks has increased significantly, with the largest deployments of the RoCEv2 protocol using the go-back-n scheme for loss recovery. However, it can waste time and bandwidth due to redundant packets and can even exacerbate network congestion levels. In this paper, we propose a combination of selective retransmission and DCQCN congestion control schemes (SR-DCQCN) to improve the loss recovery process in data centers, where the sender retransmits only the truly lost packets based on the information carried by the received NACK and the receiver achieves out-of-order packet reception with the support of bitmap. To improve loss recovery in data centers, we combine such a selective retransmission scheme with DCQCN, the default congestion control scheme for data centers, which includes DCQCN's ECN mechanism and its core congestion control algorithm. Through our experimental evaluation, SR-DCQCN reduces the 99%ile flow completion time by 51.9% compared to DCQCN when the network load utilization is 50%.
科研通智能强力驱动
Strongly Powered by AbleSci AI