计算机科学
工作流程
分布式计算
云计算
工作流管理系统
调度(生产过程)
工作流技术
贪婪算法
动态优先级调度
算法
数学优化
数据库
操作系统
地铁列车时刻表
数学
作者
Xiaoyong Tang,Wenbiao Cao,Huiya Tang,Tan Deng,Jing Mei,Yi Liu,Shi Cheng,Meng Xia,Zeng Zeng
出处
期刊:IEEE Transactions on Parallel and Distributed Systems
[Institute of Electrical and Electronics Engineers]
日期:2022-09-01
卷期号:33 (9): 2079-2092
被引量:18
标识
DOI:10.1109/tpds.2021.3134247
摘要
In recent years, more and more large-scale data processing and computing workflow applications run on heterogeneous clouds. Such cloud applications with precedence-constrained tasks are usually deadline-constrained and their scheduling is an essential problem faced by cloud providers. Moreover, minimizing the workflow execution cost based on cloud billing periods is also a complex and challenging problem for clouds. In realizing this, we first model the workflow applications as I/O Data-aware Directed Acyclic Graph (DDAG), according to clouds with global storage systems. Then, we mathematically state this deadline-constrained workflow scheduling problem with the goal of minimum execution financial cost. We also prove that the time complexity of this problem is NP-hard by deducing from a multidimensional multiple-choice knapsack problem. Third, we propose a heuristic cost-efficient task scheduling strategy called CETSS, which includes workflow DDAG model building, task subdeadline initialization, greedy workflow scheduling algorithm, and task adjusting method. The greedy workflow scheduling algorithm mainly consists of dynamical task renting billing period sharing method and unscheduled task subdeadline relax technique. We perform rigorous simulations on some synthetic randomly generated applications and real-world applications, such as Epigenomics, CyberShake, and LIGO. The experimental results clearly demonstrate that our proposed heuristic CETSS outperforms the existing algorithms and can effective save the total workflow execution cost. In particular, CETSS is very suitable for large workflow applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI