特征选择
生物标志物发现
计算机科学
判别式
特征(语言学)
选择(遗传算法)
人工智能
集成学习
生物标志物
模式识别(心理学)
代谢组学
机器学习
数据挖掘
生物信息学
蛋白质组学
生物
基因
哲学
生物化学
语言学
作者
Aliasghar Shahrjooihaghighi,Hichem Frigui,Xiang Zhang,Xiaoli Wei,Bo Shi,Craig J. McClain
标识
DOI:10.1145/3297280.3297283
摘要
Biomarker discovery, i.e., identifying the discriminative features that are responsible for alteration of a biological system, is often solved by feature selection implemented by machine learning approaches. While many individual feature selection methods are used in biomarker discovery, the nature of omics data (small number of samples, large number of features, and noisy data) makes each of those individual feature selection algorithms unstable. In this paper, we investigate various ensemble feature selection methods to improve the reliability of the molecular biomarker selection by combining the complementary information of multiple feature selection methods. We compare the performance of different ensemble approaches and evaluate their performances using a metabolomics dataset containing three sample groups. Our results indicate that our ensemble approach outperforms the individual feature selection algorithms and provides more stable results.
科研通智能强力驱动
Strongly Powered by AbleSci AI