服务器
计算机科学
二元分析
异常检测
离群值
时间序列
系列(地层学)
主成分分析
熵(时间箭头)
数据挖掘
比例(比率)
模式识别(心理学)
人工智能
机器学习
地理
物理
万维网
生物
量子力学
古生物学
地图学
作者
Rob J. Hyndman,Earo Wang,Nikolay Laptev
标识
DOI:10.1109/icdmw.2015.104
摘要
It is becoming increasingly common for organizations to collect very large amounts of data over time, and to need to detect unusual or anomalous time series. For example, Yahoo has banks of mail servers that are monitored over time. Many measurements on server performance are collected every hour for each of thousands of servers. We wish to identify servers that are behaving unusually. We compute a vector of features on each time series, measuring characteristics of the series. The features may include lag correlation, strength of seasonality, spectral entropy, etc. Then we use a principal component decomposition on the features, and use various bivariate outlier detection methods applied to the first two principal components. This enables the most unusual series, based on their feature vectors, to be identified. The bivariate outlier detection methods used are based on highest density regions and α-hulls.
科研通智能强力驱动
Strongly Powered by AbleSci AI