计算机科学
特征选择
依赖关系(UML)
利用
人工智能
特征(语言学)
冗余(工程)
相互信息
相关性(法律)
机器学习
选择(遗传算法)
数据挖掘
模糊逻辑
模式识别(心理学)
哲学
语言学
计算机安全
政治学
法学
操作系统
作者
Jinghua Liu,Yaojin Lin,Weiping Ding,Hongbo Zhang,Ji‐Xiang Du
标识
DOI:10.1109/tfuzz.2022.3182441
摘要
Multilabel feature selection (MFS) has received widespread attention in various big data applications. However, most of the existing methods either explicitly or implicitly assume that all labels are given in advance before feature selection starts; or that all labels are independent. In fact, in many practical applications, the available labels usually arrive dynamically, and they may be interdependent with each other. Moreover, labels may be generated dynamically in a minibatch manner, which makes it more difficult to explore label dependency. In this article, we propose a novel fuzzy mutual information-based multilabel feature selection approach MSDS, which is able to solve single streaming label, minibatch streaming labels, and exploit label dependency simultaneously. In specific, we first promote fuzzy mutual information to be suitable for multilabel learning. This model can effectively consider the relationship between two labels, and has good applicability for measuring the relationship between multiple labels. Then, we analyze feature relevance and feature redundancy based on the combination of label dependency and streaming labels, which helps to facilitate the selection of high-quality feature subsets. Finally, a feature conversion is designed to fuse the representative features of new arrival streaming labels. Comprehensive experiments on twelve multilabel datasets clearly reveal the superiority of the proposed method against two streaming labels based algorithms and five state-of-the-art static label space based algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI