计算机科学
公制(单位)
因果关系(物理学)
数据挖掘
路径(计算)
异常检测
根本原因
依赖关系(UML)
人工智能
机器学习
理论计算机科学
工程类
运营管理
物理
量子力学
可靠性工程
程序设计语言
作者
Xinrui Jiang,Yicheng Pan,Meng Ma,Ping Wang
标识
DOI:10.1145/3543507.3583338
摘要
Intensive monitoring and anomaly diagnosis have become a knotty problem for modern microservice architecture due to the dynamics of service dependency. While most previous studies rely heavily on ample monitoring metrics, we raise a fundamental but always neglected issue - the diagnostic metric integrity problem. This paper solves the problem by proposing MicroCU – a novel approach to diagnose microservice systems using very sparse API logs. We design a structure named dynamic causal curves to portray time-varying service dependencies and a temporal dynamics discovery algorithm based on Granger causal intervals. Our algorithm generates a smoother space of causal curves and designs the concept of causal unimodalization to calibrate the causality infidelities brought by missing metrics. Finally, a path search algorithm on dynamic causality graphs is proposed to pinpoint the root cause. Experiments on commercial system cases show that MicroCU outperforms many state-of-the-art approaches and reflects the superiorities of causal unimodalization to raw metric imputation.
科研通智能强力驱动
Strongly Powered by AbleSci AI