典型相关
正规化(语言学)
相关性
多元统计
数学
计算
典型分析
计算机科学
算法
数据挖掘
人工智能
统计
几何学
作者
Elena Tuzhilina,Leonardo Tozzi,Trevor Hastie
标识
DOI:10.1177/1471082x211041033
摘要
Canonical correlation analysis (CCA) is a technique for measuring the association between two multivariate data matrices. A regularized modification of canonical correlation analysis (RCCA) which imposes an [Formula: see text] penalty on the CCA coefficients is widely used in applications with high-dimensional data. One limitation of such regularization is that it ignores any data structure, treating all the features equally, which can be ill-suited for some applications. In this article we introduce several approaches to regularizing CCA that take the underlying data structure into account. In particular, the proposed group regularized canonical correlation analysis (GRCCA) is useful when the variables are correlated in groups. We illustrate some computational strategies to avoid excessive computations with regularized CCA in high dimensions. We demonstrate the application of these methods in our motivating application from neuroscience, as well as in a small simulation example.
科研通智能强力驱动
Strongly Powered by AbleSci AI