计算机科学
特征选择
人工智能
特征(语言学)
支持向量机
对偶(语法数字)
领域(数学)
班级(哲学)
机器学习
算法
网(多面体)
模式识别(心理学)
数学
艺术
哲学
语言学
文学类
纯数学
几何学
作者
Ling Zhuang,Honghua Dai,Xiaoshu Hang
摘要
Fish-net algorithm is a novel field learning algorithm which derives classification rules by looking at the range of values of each attribute instead of the individual point values. In this paper, we present a Feature Selection Fish-net learning algorithm to solve the Dual Imbalance problem on text classification. Dual imbalance includes the instance imbalance and feature imbalance. The instance imbalance is caused by the unevenly distributed classes and feature imbalance is due to the different document length. The proposed approach consists of two phases: (1) select a feature subset which consists of the features that are more supportive to difficult minority class; (2) construct classification rules based on the original Fish-net algorithm. Our experimental results on Reuters21578 show that the proposed approach achieves better balanced accuracy rate on both majority and minority class than Naive Bayes MultiNomial and SVM.
科研通智能强力驱动
Strongly Powered by AbleSci AI