贝叶斯信息准则
聚类分析
混合模型
选型
期望最大化算法
随机效应模型
正规化(语言学)
数学
样本量测定
高斯分布
统计
计算机科学
人工智能
最大似然
医学
荟萃分析
物理
量子力学
内科学
作者
Luoying Yang,Tong Tong Wu
出处
期刊:Biometrics
[Wiley]
日期:2022-04-28
卷期号:79 (2): 761-774
被引量:4
摘要
We propose a model-based clustering method for high-dimensional longitudinal data via regularization in this paper. This study was motivated by the Trial of Activity in Adolescent Girls (TAAG), which aimed to examine multilevel factors related to the change of physical activity by following up a cohort of 783 girls over 10 years from adolescence to early adulthood. Our goal is to identify the intrinsic grouping of subjects with similar patterns of physical activity trajectories and the most relevant predictors within each group. The previous analyses conducted clustering and variable selection in two steps, while our new method can perform the tasks simultaneously. Within each cluster, a linear mixed-effects model (LMM) is fitted with a doubly penalized likelihood to induce sparsity for parameter estimation and effect selection. The large-sample joint properties are established, allowing the dimensions of both fixed and random effects to increase at an exponential rate of the sample size, with a general class of penalty functions. Assuming subjects are drawn from a Gaussian mixture distribution, model effects and cluster labels are estimated via a coordinate descent algorithm nested inside the Expectation-Maximization (EM) algorithm. Bayesian Information Criterion (BIC) is used to determine the optimal number of clusters and the values of tuning parameters. Our numerical studies show that the new method has satisfactory performance and is able to accommodate complex data with multilevel and/or longitudinal effects.
科研通智能强力驱动
Strongly Powered by AbleSci AI