分类器(UML)
非线性系统
计算机科学
线性分类器
人工智能
模式识别(心理学)
机器学习
数据挖掘
数据分类
统计分类
班级(哲学)
数学
物理
量子力学
作者
Huan Wan,Hui Wang,Bryan Scotney,Jun Liu,Xin Wei
标识
DOI:10.1016/j.ins.2023.119485
摘要
Linear classifiers are generally simpler and more explainable than their nonlinear variants. They can achieve satisfactory classification performance on linearly separable data, but not on nonlinear data. So, linear classifiers need extending, typically by modification of their algorithms, resulting in their nonlinear variants. In this paper we present one general method, cluster-based data relabelling (CBDR), that allows linear classifiers to work effectively on nonlinear data. CBDR partitions the data set into several non-overlapping class-specific clusters and relabels data by the clusters. A linear classifier can then be applied to the relabelled data to seek cluster-based linear decision boundaries instead of class-based decision boundaries. Extensive experimentation has demonstrated that CBDR can significantly enhance the classification performance of linear classifiers, and even outperform their nonlinear variants. Further experimentation has demonstrated that CBDR can also improve the classification performance of nonlinear classifiers. Most significant outperformance was observed on imbalanced data in both cases.
科研通智能强力驱动
Strongly Powered by AbleSci AI