异常检测
离群值
线性子空间
计算机科学
子空间拓扑
水准点(测量)
数据挖掘
模式识别(心理学)
集合(抽象数据类型)
领域(数学)
数据集
人工智能
算法
数学
地理
纯数学
程序设计语言
大地测量学
几何学
作者
Ismael Cabero Fayos,Irene Epifanio,Ana Piérola,Alfredo Ballester
标识
DOI:10.1016/j.knosys.2021.106830
摘要
The problem of detecting outliers in multivariate data sets with continuous numerical features is addressed by a new method. This method combines projections into relevant subspaces by archetype analysis with a nearest neighbor algorithm, through an appropriate ensemble of the results. Our method is able to detect an anomaly in a simple data set with a linear correlation of two features, while other methods fail to recognize that anomaly. Our method performs among top in an extensive comparison with 23 state-of-the-art outlier detection algorithms with several benchmark data sets. Finally, a novel industrial data set is introduced, and an outlier analysis is carried out to improve the fit of footwear, since this kind of analysis has never been fully exploited in the anthropometric field.
科研通智能强力驱动
Strongly Powered by AbleSci AI