张量(固有定义)
计算机科学
秩(图论)
吉布斯抽样
可扩展性
贝叶斯概率
分解
乘法函数
二进制数
水准点(测量)
采样(信号处理)
塔克分解
理论计算机科学
算法
数据挖掘
人工智能
数学
张量分解
组合数学
数据库
滤波器(信号处理)
纯数学
地理
数学分析
算术
生物
计算机视觉
生态学
大地测量学
作者
Piyush Rai,Yingjian Wang,Shengbo Guo,Gary Chen,David B. Dunson,Lawrence Carin
摘要
We present a scalable Bayesian framework for low-rank decomposition of multiway tensor data with missing observations. The key issue of pre-specifying the rank of the decomposition is sidestepped in a principled manner using a multiplicative gamma process prior. Both continuous and binary data can be analyzed under the framework, in a coherent way using fully conjugate Bayesian analysis. In particular, the analysis in the non-conjugate binary case is facilitated via the use of the Polya-Gamma sampling strategy which elicits closed-form Gibbs sampling updates. The resulting samplers are efficient and enable us to apply our framework to large-scale problems, with time-complexity that is linear in the number of observed entries in the tensor. This is especially attractive in analyzing very large but sparsely observed tensors with very few known entries. Moreover, our method admits easy extension to the supervised setting where entities in one or more tensor modes have labels. Our method outperforms several state-of-the-art tensor decomposition methods on various synthetic and benchmark real-world datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI