计算机科学
强化学习
供应
可靠性(半导体)
有界函数
虚拟化
服务质量
跟踪(心理语言学)
数学优化
分布式计算
钥匙(锁)
约束(计算机辅助设计)
功能(生物学)
动态规划
软件
云计算
人工智能
算法
计算机网络
计算机安全
哲学
数学
数学分析
语言学
工程类
生物
操作系统
功率(物理)
量子力学
进化生物学
程序设计语言
机械工程
物理
作者
Yue Zeng,Zhihao Qu,Song Guo,Bin Tang,Baoliu Ye,Jing Li,Jie Zhang
出处
期刊:IEEE Transactions on Services Computing
[Institute of Electrical and Electronics Engineers]
日期:2023-05-31
卷期号:16 (5): 3651-3664
被引量:2
标识
DOI:10.1109/tsc.2023.3281759
摘要
As a key enabling technology for 5G, network function virtualization abstracts services into software-based service function chains (SFCs), facilitating mission-critical services with high-reliability requirements. However, it is challenging to cost-effectively provide reliable SFCs in dynamic environments due to delayed rewards caused by future SFC requests, limited infrastructure resources, and heterogeneity in hardware and software reliability. Although deep reinforcement learning (DRL) can effectively capture delayed rewards in dynamic environments, its trial-and-error exploration in a vast solution space with massive infeasible solutions may lead to frequent constraint violations and traps in poor local optima. To address these challenges, we propose a RuleDRL algorithm that combines the capability of DRL to capture delayed rewards and the strength of rule-based schemes to explore high-quality solutions without violating constraints. Specifically, we first formulate the reliable SFC provision problem as an integer nonlinear programming problem, which is proven to be NP-hard. Then, we jointly design DRL and rule-based schemes that are coupled to make the final decision and establish a bounded approximation ratio in general cases. Extensive trace-driven simulations show that RuleDRL can save the total cost by up to 65.67% and improve the SFC acceptance ratio by up to 82%, compared to the state-of-the-art solution.
科研通智能强力驱动
Strongly Powered by AbleSci AI