概念漂移
分类器(UML)
过采样
计算机科学
数据流
随机子空间法
数据流挖掘
人工智能
数据挖掘
集成学习
机器学习
选择(遗传算法)
模式识别(心理学)
带宽(计算)
计算机网络
电信
作者
Botao Jiao,Yinan Guo,Dunwei Gong,Qiuju Chen
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2022-06-22
卷期号:35 (1): 1278-1291
被引量:39
标识
DOI:10.1109/tnnls.2022.3183120
摘要
Ensemble learning, as a popular method to tackle concept drift in data stream, forms a combination of base classifiers according to their global performances. However, concept drift generally occurs in local data space, causing significantly different performances of a base classifier at different locations. Thus, employing global performance as a criterion to select base classifier is inappropriate. Moreover, data stream is often accompanied by class imbalance problem, which affects the classification accuracy of ensemble learning on minority instances. To drawback these problems, a dynamic ensemble selection for imbalanced data streams with concept drift (DES-ICD) is proposed. For data arrived in chunk-by-chunk, a novel synthetic minority oversampling technique with adaptive nearest neighbors (AnnSMOTE) is developed to generate new minority instances that conform to the new concept. Following that, DES-ICD creates a base classifier on newly arrived data chunk balanced by AnnSMOTE and merges it with historical base classifiers to form a candidate classifier pool. For each query instance, the optimal combination is constructed in terms of the performance of candidate classifiers in its neighborhood. Experimental results for nine synthetic and five real-world datasets show that the proposed method outperforms seven comparative methods on classification accuracy and tracks new concepts in an imbalanced data stream more preciously.
科研通智能强力驱动
Strongly Powered by AbleSci AI