绩效指标
计算机科学
数据挖掘
修剪
根本原因分析
云计算
词根(语言学)
钥匙(锁)
根本原因
集合(抽象数据类型)
任务(项目管理)
异常检测
树(集合论)
人工智能
可靠性工程
数学
工程类
数学分析
语言学
哲学
计算机安全
管理
系统工程
农学
经济
生物
程序设计语言
操作系统
作者
Yaxing Li,Yuanqing Xia,Yufeng Zhan,Runze Gao,Chu-ge Wu
标识
DOI:10.1109/cac53003.2021.9727417
摘要
Additive key performance indicators (KPI) with multi-dimensional attributes are important monitoring indicators in internet companies. When the anomaly occurs in the overall KPI, it is critical but challenging to locate the root cause of the anomaly. There are mainly two important challenges in the task. Firstly, it is difficult to locate the root causes in two typical scenarios: the amount of anomalies is not obvious and two kinds of abnormal variations interact. Secondly, the number of KPI attribute value combinations is huge, which requires higher real-time performance of the algorithm. In this paper, a robust and rapid root cause location approach, RootMiner, is proposed to address the above challenges. Firstly, a new evaluation function is adopted to achieve good results in more complex scenarios. Secondly, a multi-tree data structure as well as pre-pruning strategy is applied to improve the calculation efficiency. Based on real data set from Alibaba Cloud Computing, the experiment results show that RootMiner achieves a great improvement in effectiveness, with an average improvement of 40% compared with the state-of-the-art. The results also show that RootMiner reduces the runtime from 10s to 1s on average.
科研通智能强力驱动
Strongly Powered by AbleSci AI