Lasso(编程语言)
理论(学习稳定性)
校准
选择(遗传算法)
弹性网正则化
计算机科学
块(置换群论)
图形模型
回归
数据挖掘
特征选择
机器学习
人工智能
数学
统计
几何学
万维网
作者
Barbara Bodinier,Sarah Filippi,Therese Haugdahl Nøst,Julien Chiquet,Marc Chadeau‐Hyam
标识
DOI:10.1093/jrsssc/qlad058
摘要
Abstract Stability selection represents an attractive approach to identify sparse sets of features jointly associated with an outcome in high-dimensional contexts. We introduce an automated calibration procedure via maximisation of an in-house stability score and accommodating a priori-known block structure (e.g. multi-OMIC) data. It applies to [Least Absolute Shrinkage Selection Operator (LASSO)] penalised regression and graphical models. Simulations show our approach outperforms non-stability-based and stability selection approaches using the original calibration. Application to multi-block graphical LASSO on real (epigenetic and transcriptomic) data from the Norwegian Women and Cancer study reveals a central/credible and novel cross-OMIC role of LRRN3 in the biological response to smoking. Proposed approaches were implemented in the R package sharp.
科研通智能强力驱动
Strongly Powered by AbleSci AI