计算机科学
深度学习
推论
人工智能
分布式计算
地铁列车时刻表
有向无环图
云计算
机器学习
排队
边缘计算
GSM演进的增强数据速率
计算机网络
算法
操作系统
作者
Chaofeng Zhang,Mianxiong Dong,Kaoru Ota
出处
期刊:IEEE Transactions on Services Computing
[Institute of Electrical and Electronics Engineers]
日期:2022-03-01
卷期号:15 (2): 627-639
被引量:10
标识
DOI:10.1109/tsc.2021.3113184
摘要
As the inevitable part of intelligent service in the new era, the services for AI tasks themselves have received significant attention, which due to the urgency of energy and computing resources, is difficult to implement in a stable and widely distributed system and coordinately utilize remote edge devices and cloud. In this article, we introduce an AI-based holistic network optimization solution to schedule AI services. Our proposed deep Q-learning algorithm optimizes the overall throughput of AI co-inference tasks themselves by balancing the uneven computation resources and traffic conditions. We use a multi-hop DAG (Directed Acyclic Graph) to describe a deep neural network (DNN) based co-inference network structure and introduce the virtual queue to analyze the Lyapunov stability for the system. Then, a priority-based data forwarding strategy is proposed to maximize the bandwidth efficiency, and we develop a Real-time Deep Q-learning based Edge Forwarding Scheme Optimization Algorithm (RDFO) to maximize the overall task processing rate. Finally, we conduct the platform simulation for the distributed co-inference system. Through the comparison with other benchmarks, we testify to the optimality of our proposal.
科研通智能强力驱动
Strongly Powered by AbleSci AI