马尔可夫毯
特征选择
水准点(测量)
计算机科学
特征(语言学)
约束(计算机辅助设计)
集合(抽象数据类型)
条件独立性
独立性(概率论)
算法
贝叶斯网络
选择(遗传算法)
机器学习
人工智能
马尔可夫链
数据挖掘
马尔可夫模型
数学
变阶马尔可夫模型
语言学
哲学
统计
几何学
大地测量学
程序设计语言
地理
作者
Niantai Wang,Haoran Liu,Liyue Zhang,Yanbin Cai,Qianrui Shi
标识
DOI:10.1016/j.knosys.2023.111216
摘要
The Markov blanket (MB) represents a crucial concept in a Bayesian network (BN) and is theoretically the optimal solution to the feature selection problem. Methods based on conditional independence (CI) tests are prevalent for MB discovery. Currently, the main challenge is how to improve both the efficiency and effectiveness of this type of method. In this paper, we propose a novel divide-and-conquer discovery algorithm, loose-to-strict MB (LSMB), to discover MBs faster while maintaining high accuracy. LSMB first discovers the approximate parent–child (PC) and spouse sets of a target variable via the loose CI test strategy, a constraint for the condition set of CI tests. Then, by strict CI tests, i.e., without constraint for the size of the condition set, LSMB first removes non-MB nodes in the discovered approximate PC set and then categorizes and removes non-MB nodes in the discovered approximate spouse set. Finally, LSMB combines the discovered sets to obtain the MB of the target. Experiments on benchmark BN datasets show that LSMB can improve the efficiency of MB discovery while maintaining higher accuracy than the state-of-the-art MB discovery algorithms, and experiments on real-world datasets demonstrate the excellent performance of LSMB in feature selection.
科研通智能强力驱动
Strongly Powered by AbleSci AI