特征选择
计算机科学
特征(语言学)
理论(学习稳定性)
集合(抽象数据类型)
数据集
选择(遗传算法)
数据挖掘
人工智能
简单(哲学)
机器学习
模式识别(心理学)
语言学
认识论
哲学
程序设计语言
作者
Luca Pestarino,Giovanni Fiorito,Silvia Polidoro,Paolo Víneis,Andrea Cavalli,Sergio Decherchi
标识
DOI:10.1109/ijcnn52387.2021.9533806
摘要
Feature selection is a prominent activity when dealing with classification/regression problems in biological and omics data. Despite the effort devoted to this issue theoretically, feature selection stability within and across methods is often overlooked in practice. This is a compelling issue because a unique or at least stable answer is needed in clinical scenarios. Here, we analyse in detail a multiomics small sample data set, the Oxford Street II data set, and discuss how existing methods perform in terms of the usual metrics but also in terms of intra and inter feature selection stability. To mitigate the observed instability, we propose a simple unsupervised feature prefiltering, achieving promising results.
科研通智能强力驱动
Strongly Powered by AbleSci AI