计算机科学
云计算
可扩展性
分布式计算
领域(数学分析)
生产(经济)
桥(图论)
人工智能
机器学习
数据挖掘
数据库
操作系统
医学
内科学
数学分析
宏观经济学
经济
数学
作者
Haibo Mi,Huaimin Wang,Yangfan Zhou,Michael R. Lyu,Hubo Cai
摘要
Performance diagnosis is labor intensive in production cloud computing systems. Such systems typically face many real-world challenges, which the existing diagnosis techniques for such distributed systems cannot effectively solve. An efficient, unsupervised diagnosis tool for locating fine-grained performance anomalies is still lacking in production cloud computing systems. This paper proposes CloudDiag to bridge this gap. Combining a statistical technique and a fast matrix recovery algorithm, CloudDiag can efficiently pinpoint fine-grained causes of the performance problems, which does not require any domain-specific knowledge to the target system. CloudDiag has been applied in a practical production cloud computing systems to diagnose performance problems. We demonstrate the effectiveness of CloudDiag in three real-world case studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI