可解释性
插补(统计学)
缺少数据
计算机科学
人工智能
深度学习
概率逻辑
降维
时间序列
机器学习
高斯过程
多元统计
数据挖掘
维数之咒
模式识别(心理学)
高斯分布
量子力学
物理
作者
Vincent Fortuin,Dmitry Baranchuk,Gunnar Rätsch,Stephan Mandt
出处
期刊:Cornell University - arXiv
日期:2019-01-01
被引量:27
标识
DOI:10.48550/arxiv.1907.04155
摘要
Multivariate time series with missing values are common in areas such as healthcare and finance, and have grown in number and complexity over the years. This raises the question whether deep learning methodologies can outperform classical data imputation methods in this domain. However, naive applications of deep learning fall short in giving reliable confidence estimates and lack interpretability. We propose a new deep sequential latent variable model for dimensionality reduction and data imputation. Our modeling assumption is simple and interpretable: the high dimensional time series has a lower-dimensional representation which evolves smoothly in time according to a Gaussian process. The non-linear dimensionality reduction in the presence of missing data is achieved using a VAE approach with a novel structured variational approximation. We demonstrate that our approach outperforms several classical and deep learning-based data imputation methods on high-dimensional data from the domains of computer vision and healthcare, while additionally improving the smoothness of the imputations and providing interpretable uncertainty estimates.
科研通智能强力驱动
Strongly Powered by AbleSci AI