自编码
超参数
计算机科学
人工智能
机器学习
代表(政治)
启发式
无监督学习
特征学习
潜变量
模式识别(心理学)
MNIST数据库
深度学习
政治学
政治
法学
作者
Irina Higgins,Löıc Matthey,Arka Pal,Christopher Burgess,Xavier Glorot,Matthew Botvinick,Shakir Mohamed,Alexander Lerchner
出处
期刊:International Conference on Learning Representations
日期:2017-04-24
被引量:2118
摘要
Learning an interpretable factorised representation of the independent data generative factors of the world without supervision is an important precursor for the development of artificial intelligence that is able to learn and reason in the same way that humans do. We introduce beta-VAE, a new state-of-the-art framework for automated discovery of interpretable factorised latent representations from raw image data in a completely unsupervised manner. Our approach is a modification of the variational autoencoder (VAE) framework. We introduce an adjustable hyperparameter beta that balances latent channel capacity and independence constraints with reconstruction accuracy. We demonstrate that beta-VAE with appropriately tuned beta > 1 qualitatively outperforms VAE (beta = 1), as well as state of the art unsupervised (InfoGAN) and semi-supervised (DC-IGN) approaches to disentangled factor learning on a variety of datasets (celebA, faces and chairs). Furthermore, we devise a protocol to quantitatively compare the degree of disentanglement learnt by different models, and show that our approach also significantly outperforms all baselines quantitatively. Unlike InfoGAN, beta-VAE is stable to train, makes few assumptions about the data and relies on tuning a single hyperparameter, which can be directly optimised through a hyper parameter search using weakly labelled data or through heuristic visual inspection for purely unsupervised data.
科研通智能强力驱动
Strongly Powered by AbleSci AI