计算机科学
羽流
强化学习
一般化
延迟(音频)
人工智能
实时计算
电信
气象学
数学分析
物理
数学
作者
Dehui Wei,Jiao Zhang,Xuan Zhang,Chengyuan Huang
出处
期刊:China Communications
[Institute of Electrical and Electronics Engineers]
日期:2022-05-10
卷期号:19 (12): 101-117
被引量:1
标识
DOI:10.23919/jcc.2022.00.019
摘要
Congestion control (CC) is always an important issue in the field of networking, and the enthusiasm for its research has never diminished in both academia and industry. In current years, due to the rapid development of machine learning (ML), the combination of reinforcement learning (RL) and CC has a striking effect. However, These complicated schemes lack generalization and are too heavyweight in storage and computing to be directly implemented in mobile devices. In order to address these problems, we propose Plume, a high-performance, lightweight and generalized RL-CC scheme. Plume proposes a lightweight framework to reduce the overheads while preserving the original performance. Besides, Plume innovatively modifies the framework parameters of the reward function during the retraining process, so that the algorithm can be applied to a variety of scenarios. Evaluation results show that Plume can retain almost all the performance of the original model but the size and decision latency can be reduced by more than 50% and 20%, respectively. Moreover, Plume has better performances in some special scenes.
科研通智能强力驱动
Strongly Powered by AbleSci AI