计算机科学
云计算
可扩展性
分布式计算
领域(数学分析)
生产(经济)
桥(图论)
人工智能
机器学习
数据挖掘
数据库
操作系统
医学
内科学
数学分析
宏观经济学
经济
数学
作者
Haibo Mi,Huaimin Wang,Yangfan Zhou,Michael R. Lyu,Hubo Cai
出处
期刊:IEEE Transactions on Parallel and Distributed Systems
[Institute of Electrical and Electronics Engineers]
日期:2013-06-01
卷期号:24 (6): 1245-1255
被引量:108
摘要
Performance diagnosis is labor intensive in production cloud computing systems. Such systems typically face many real-world challenges, which the existing diagnosis techniques for such distributed systems cannot effectively solve. An efficient, unsupervised diagnosis tool for locating fine-grained performance anomalies is still lacking in production cloud computing systems. This paper proposes CloudDiag to bridge this gap. Combining a statistical technique and a fast matrix recovery algorithm, CloudDiag can efficiently pinpoint fine-grained causes of the performance problems, which does not require any domain-specific knowledge to the target system. CloudDiag has been applied in a practical production cloud computing systems to diagnose performance problems. We demonstrate the effectiveness of CloudDiag in three real-world case studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI