Oblique and rotation double random forest

随机森林 斜格 数学 多元随机变量 决策树 计算机科学 人工智能 模式识别(心理学) 算法 随机变量 统计 哲学 语言学
作者
M. A. Ganaie,M. Tanveer,Ponnuthurai Nagaratnam Suganthan,Václav Snåšel
出处
期刊:Neural Networks [Elsevier]
卷期号:153: 496-517 被引量:33
标识
DOI:10.1016/j.neunet.2022.06.012
摘要

Random Forest is an ensemble of decision trees based on the bagging and random subspace concepts. As suggested by Breiman, the strength of unstable learners and the diversity among them are the ensemble models' core strength. In this paper, we propose two approaches known as oblique and rotation double random forests. In the first approach, we propose rotation based double random forest. In rotation based double random forests, transformation or rotation of the feature space is generated at each node. At each node different random feature subspace is chosen for evaluation, hence the transformation at each node is different. Different transformations result in better diversity among the base learners and hence, better generalization performance. With the double random forest as base learner, the data at each node is transformed via two different transformations namely, principal component analysis and linear discriminant analysis. In the second approach, we propose oblique double random forest. Decision trees in random forest and double random forest are univariate, and this results in the generation of axis parallel split which fails to capture the geometric structure of the data. Also, the standard random forest may not grow sufficiently large decision trees resulting in suboptimal performance. To capture the geometric properties and to grow the decision trees of sufficient depth, we propose oblique double random forest. The oblique double random forest models are multivariate decision trees. At each non-leaf node, multisurface proximal support vector machine generates the optimal plane for better generalization performance. Also, different regularization techniques (Tikhonov regularization, axis-parallel split regularization, Null space regularization) are employed for tackling the small sample size problems in the decision trees of oblique double random forest. The proposed ensembles of decision trees produce trees with bigger size compared to the standard ensembles of decision trees as bagging is used at each non-leaf node which results in improved performance. The evaluation of the baseline models and the proposed oblique and rotation double random forest models is performed on benchmark 121 UCI datasets and real-world fisheries datasets. Both statistical analysis and the experimental results demonstrate the efficacy of the proposed oblique and rotation double random forest models compared to the baseline models on the benchmark datasets.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
无我完成签到,获得积分10
刚刚
啦啦啦啦发布了新的文献求助10
1秒前
1秒前
1秒前
1秒前
1秒前
1秒前
长青发布了新的文献求助10
1秒前
why完成签到,获得积分10
1秒前
柳叶笺发布了新的文献求助10
2秒前
2秒前
2秒前
进击的然发布了新的文献求助30
2秒前
无花果应助所谓采纳,获得10
3秒前
3秒前
Doss发布了新的文献求助30
3秒前
六百六十六完成签到,获得积分10
3秒前
science发布了新的文献求助10
4秒前
4秒前
4秒前
科研通AI6.1应助yu采纳,获得10
5秒前
5秒前
5秒前
agont完成签到,获得积分10
5秒前
Owen应助咕噜采纳,获得10
5秒前
唠叨的富完成签到,获得积分10
6秒前
追风应助长野采纳,获得10
6秒前
温暖元容发布了新的文献求助10
6秒前
Waley完成签到 ,获得积分10
7秒前
万能图书馆应助LeKuai采纳,获得10
7秒前
小郭发布了新的文献求助10
7秒前
tskylarium发布了新的文献求助10
7秒前
ZHR完成签到 ,获得积分10
7秒前
Gary发布了新的文献求助10
8秒前
KY发布了新的文献求助10
8秒前
8秒前
Garcia发布了新的文献求助10
8秒前
Lishuhuiii发布了新的文献求助30
9秒前
9秒前
乔杰发布了新的文献求助10
9秒前
高分求助中
Modern Epidemiology, Fourth Edition 5000
Kinesiophobia : a new view of chronic pain behavior 5000
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 3000
Digital Twins of Advanced Materials Processing 2000
Propeller Design 2000
Weaponeering, Fourth Edition – Two Volume SET 2000
Handbook of pharmaceutical excipients, Ninth edition 1500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 化学工程 生物化学 物理 计算机科学 内科学 复合材料 催化作用 物理化学 光电子学 电极 冶金 细胞生物学 基因
热门帖子
关注 科研通微信公众号,转发送积分 6010713
求助须知:如何正确求助?哪些是违规求助? 7556949
关于积分的说明 16134672
捐赠科研通 5157432
什么是DOI,文献DOI怎么找? 2762388
邀请新用户注册赠送积分活动 1740990
关于科研通互助平台的介绍 1633476