可解释性
选择(遗传算法)
特征选择
数据挖掘
光学(聚焦)
计算机科学
数学
回归
选型
线性回归
机器学习
人工智能
统计
物理
光学
作者
Mingyu Gao,Canhong Wen
标识
DOI:10.1080/00949655.2022.2029444
摘要
As a tool for producing meaningful and interpretable results, subset or variable selection has been well studied in modern statistics. However, most of the existing methods focus on the independent data and cannot directly extend to the network-linked data where samples are connected with each other. To this end, we propose a subset selection method in the linear regression model by incorporating the network information into the intercept term, which can achieve automatic subset selection and have good network structural interpretability simultaneously. Based on this, we develop an efficient algorithm to recover the true subset, as well as determine subgroups. Simulation studies demonstrate that the proposal outperforms the state-of-art methods in estimation and selection accuracy. We also apply the proposed method on data from the national longitudinal study of adolescent health and show the superiority of selecting variables alone a network by a smaller model size and more accurate prediction.
科研通智能强力驱动
Strongly Powered by AbleSci AI