MNIST数据库
计算机科学
生成语法
人工智能
代表(政治)
对抗制
扩展(谓词逻辑)
相互信息
机器学习
生成对抗网络
面子(社会学概念)
变化(天文学)
模式识别(心理学)
深度学习
社会学
物理
政治
程序设计语言
法学
天体物理学
社会科学
政治学
作者
Xi Chen,Yan Duan,Rein Houthooft,John Schulman,Ilya Sutskever,Pieter Abbeel
出处
期刊:Cornell University - arXiv
日期:2016-06-12
被引量:810
摘要
This paper describes InfoGAN, an information-theoretic extension to the Generative Adversarial Network that is able to learn disentangled representations in a completely unsupervised manner. InfoGAN is a generative adversarial network that also maximizes the mutual information between a small subset of the latent variables and the observation. We derive a lower bound to the mutual information objective that can be optimized efficiently, and show that our training procedure can be interpreted as a variation of the Wake-Sleep algorithm. Specifically, InfoGAN successfully disentangles writing styles from digit shapes on the MNIST dataset, pose from lighting of 3D rendered images, and background digits from the central digit on the SVHN dataset. It also discovers visual concepts that include hair styles, presence/absence of eyeglasses, and emotions on the CelebA face dataset. Experiments show that InfoGAN learns interpretable representations that are competitive with representations learned by existing fully supervised methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI