特征选择
特征(语言学)
马尔可夫毯
计算机科学
可解释性
人工智能
机器学习
数据挖掘
选择(遗传算法)
流式数据
可预测性
模式识别(心理学)
数学
统计
马尔可夫链
马尔可夫模型
哲学
马尔可夫性质
语言学
作者
Dianlong You,Ruiqi Li,Shunpan Liang,Mucun Sun,Xinju Ou,F. Yuan,Limin Shen,Xindong Wu
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2023-03-01
卷期号:34 (3): 1563-1577
被引量:9
标识
DOI:10.1109/tnnls.2021.3105585
摘要
Recently, causal feature selection (CFS) has attracted considerable attention due to its outstanding interpretability and predictability performance. Such a method primarily includes the Markov blanket (MB) discovery and feature selection based on Granger causality. Representatively, the max–min MB (MMMB) can mine an optimal feature subset, i.e., MB; however, it is unsuitable for streaming features. Online streaming feature selection (OSFS) via online process streaming features can determine parents and children (PC), a subset of MB; however, it cannot mine the MB of the target attribute ( $T$ ), i.e., a given feature, thus resulting in insufficient prediction accuracy. The Granger selection method (GSM) establishes a causal matrix of all features by performing excessively time; however, it cannot achieve a high prediction accuracy and only forecasts fixed multivariate time series data. To address these issues, we proposed an online CFS for streaming features (OCFSSFs) that mine MB containing PC and spouse and adopt the interleaving PC and spouse learning method. Furthermore, it distinguishes between PC and spouse in real time and can identify children with parents online when identifying spouses. We experimentally evaluated the proposed algorithm on synthetic datasets using precision, recall, and distance. In addition, the algorithm was tested on real-world and time series datasets using classification precision, the number of selected features, and running time. The results validated the effectiveness of the proposed algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI