异常检测
计算机科学
聚类分析
数据挖掘
网格
异常(物理)
高斯分布
水准点(测量)
确定数据集中的群集数
噪音(视频)
模式识别(心理学)
流式数据
人工智能
CURE数据聚类算法
相关聚类
数学
地质学
物理
大地测量学
量子力学
凝聚态物理
几何学
图像(数学)
作者
Beiji Zou,Kangkang Yang,Xiaoyan Kui,Jun Liu,Shenghui Liao,Wei Zhao
标识
DOI:10.1016/j.ins.2023.118989
摘要
A massive amount of real-time and evolving streaming data are produced from various devices and applications. Anomaly detection is one of the main tasks of streaming data mining with many practical applications. However, without prior knowledge, it is difficult to detect the anomaly accurately and quickly. In this paper, we propose an unsupervised anomaly detection algorithm (GC-ADS), which is based on the idea of grid clustering and Gaussian distribution. Specifically, the data space is first segmented using the grid structure, then the data points are mapped to grids and finally grids are clustered. The anomaly can be preliminarily judged according to the cluster density. To solve the problem that clustering cannot distinguish between noise and anomaly, based on the idea of data similarity and Gaussian distribution, a noise recognition model is designed. In addition, a data filtering model based on grid and sliding window is designed to save memory and retain valid information. The proposed method is compared with the state-of-the-art methods on the Numenta anomaly benchmark. Experimental results indicate that GC-ADS detects anomalies more accurately than other methods with less time cost.
科研通智能强力驱动
Strongly Powered by AbleSci AI