粗集
熵(时间箭头)
数据挖掘
条件熵
计算机科学
信息图表
基于优势度的粗糙集方法
联合熵
数学
信息论
算法
人工智能
统计
二元熵函数
最大熵原理
物理
量子力学
最大熵热力学
作者
Haixin Huang,Zhaowen Li,Fang Liu,Ching‐Feng Wen
标识
DOI:10.1016/j.ins.2024.120334
摘要
Due to limitations in data acquisition, data in real life often contains a wealth of uncertain information. Uncertainty measurement (UM) constructed within the framework of rough set theory (RST) is an important tool for processing uncertain information. Some basic UMs in RST such as classification precision, rough membership degree, dependence degree, and attribute importance cannot accurately measure the uncertainty of a hybrid information system (HIS). For example, dependence degree only considers the information provided by the lower approximation of the decision and ignores the upper approximation, which may lead to some information loss. In addition to these basic UMs, some extended entropy-based UMs such as rough entropy, information entropy and conditional entropy are also frequently used to measure the uncertainty of a HIS. However, these three UMs also have their own drawbacks. For instance, rough entropy is sensitive to the distribution of hybrid data. When the distribution of hybrid data is uneven, the calculation results of rough entropy may be greatly affected, leading to a decrease in measurement accuracy. This paper proposes four new UMs in a HIS and provides an application in attribute reduction. First of all, a distance function is defined to deal with each type of attribute in a HIS and construct a tolerance relation. On this basis, four UMs are listed to measure the uncertainty of a HIS. Next, the strength and weakness of the proposed UMs are verified by statistical analysis. Subsequently, the UM with the best performance is selected to design an attribute reduction algorithm. Finally, the designed algorithm is compared with other five attribute reduction algorithms to show its superior performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI