计算机科学
特征选择
数据挖掘
特征(语言学)
度量(数据仓库)
扩展(谓词逻辑)
粗集
选择(遗传算法)
集合(抽象数据类型)
关系(数据库)
班级(哲学)
数据集
机器学习
人工智能
哲学
程序设计语言
语言学
作者
Jie Zhao,Yun Ling,Faliang Huang,Jiahai Wang,Eric Wing Kuen See-To
标识
DOI:10.1016/j.patcog.2023.110125
摘要
Tolerance Rough Set (TRS) theory is commonly employed for feature selection with incomplete data. However, TRS has limitations such as ignoring uncertainty, which often leads to the inclusion of redundant features and diminished classification accuracy. To address these limitations, we propose an extension called Subrelation Tolerance Class (STC). STC decomposes the tolerance relation into two subrelations, enabling a two-stage certainty measurement. This approach progressively filters out certain regions, thereby reducing computational space requirements, and introduces a new significance measure that considers both certain and uncertain information. Leveraging STC and our proposed measure, we develop an incremental feature selection algorithm capable of handling incomplete streaming data. We conduct experiments on real-world datasets and compare the performance with existing algorithms to validate the superiority of our method. The experimental results show that our algorithm reduces the execution time by over 89.78% compared to the baselines while maintaining the classification accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI