人工智能
过采样
过度拟合
机器学习
计算机科学
支持向量机
分类器(UML)
特征向量
人工神经网络
模式识别(心理学)
数据挖掘
带宽(计算)
计算机网络
作者
Kefan Wang,Jing An,Zhen Wei,Can Cui,Xianghua Ma,Chao Ma,Hanqiu Bao
标识
DOI:10.3389/fbioe.2021.802712
摘要
Imbalanced classification is widespread in the fields of medical diagnosis, biomedicine, smart city and Internet of Things. The imbalance of data distribution makes traditional classification methods more biased towards majority classes and ignores the importance of minority class. It makes the traditional classification methods ineffective in imbalanced classification. In this paper, a novel imbalance classification method based on deep learning and fuzzy support vector machine is proposed and named as DFSVM. DFSVM first uses a deep neural network to obtain an embedding representation of the data. This deep neural network is trained by using triplet loss to enhance similarities within classes and differences between classes. To alleviate the effects of imbalanced data distribution, oversampling is performed in the embedding space of the data. In this paper, we use an oversampling method based on feature and center distance, which can obtain more diverse new samples and prevent overfitting. To enhance the impact of minority class, we use a fuzzy support vector machine (FSVM) based on cost-sensitive learning as the final classifier. FSVM assigns a higher misclassification cost to minority class samples to improve the classification quality. Experiments were performed on multiple biological datasets and real-world datasets. The experimental results show that DFSVM has achieved promising classification performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI