马尔可夫毯
特征选择
计算机科学
马尔可夫链
算法
变量(数学)
可扩展性
集合(抽象数据类型)
机器学习
选择(遗传算法)
数据挖掘
人工智能
马尔可夫模型
数学
变阶马尔可夫模型
数学分析
数据库
程序设计语言
作者
Constantin F. Aliferis,Ioannis Tsamardinos,Alexander Statnikov
出处
期刊:PubMed
日期:2003-01-01
卷期号:: 21-5
被引量:138
摘要
We introduce a novel, sound, sample-efficient, and highly-scalable algorithm for variable selection for classification, regression and prediction called HITON. The algorithm works by inducing the Markov Blanket of the variable to be classified or predicted. A wide variety of biomedical tasks with different characteristics were used for an empirical evaluation. Namely, (i) bioactivity prediction for drug discovery, (ii) clinical diagnosis of arrhythmias, (iii) bibliographic text categorization, (iv) lung cancer diagnosis from gene expression array data, and (v) proteomics-based prostate cancer detection. State-of-the-art algorithms for each domain were selected for baseline comparison.(1) HITON reduces the number of variables in the prediction models by three orders of magnitude relative to the original variable set while improving or maintaining accuracy. (2) HITON outperforms the baseline algorithms by selecting more than two orders-of-magnitude smaller variable sets than the baselines, in the selected tasks and datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI