单变量
随机森林
计算机科学
分类
背景(考古学)
多元统计
纵向数据
实施
数据挖掘
机器学习
软件
数据科学
人工智能
软件工程
古生物学
程序设计语言
生物
作者
Jianchang Hu,Silke Szymczak
摘要
In longitudinal studies variables are measured repeatedly over time, leading to clustered and correlated observations. If the goal of the study is to develop prediction models, machine learning approaches such as the powerful random forest (RF) are often promising alternatives to standard statistical methods, especially in the context of high-dimensional data. In this paper, we review extensions of the standard RF method for the purpose of longitudinal data analysis. Extension methods are categorized according to the data structures for which they are designed. We consider both univariate and multivariate response longitudinal data and further categorize the repeated measurements according to whether the time effect is relevant. Even though most extensions are proposed for low-dimensional data, some can be applied to high-dimensional data. Information of available software implementations of the reviewed extensions is also given. We conclude with discussions on the limitations of our review and some future research directions.
科研通智能强力驱动
Strongly Powered by AbleSci AI