计算机科学
剪裁(形态学)
超参数
李普希茨连续性
约束(计算机辅助设计)
生成语法
规范(哲学)
图层(电子)
数学优化
人工智能
算法
数学
语言学
数学分析
哲学
有机化学
化学
法学
政治学
几何学
作者
Ishaan Gulrajani,Faruk Ahmed,Martín Arjovsky,Vincent Dumoulin,Aaron Courville
摘要
Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN (WGAN) makes progress toward stable training of GANs, but sometimes can still generate only low-quality samples or fail to converge. We find that these problems are often due to the use of weight clipping in WGAN to enforce a Lipschitz constraint on the critic, which can lead to undesired behavior. We propose an alternative to clipping weights: penalize the norm of gradient of the critic with respect to its input. Our proposed method performs better than standard WGAN and enables stable training of a wide variety of GAN architectures with almost no hyperparameter tuning, including 101-layer ResNets and language models over discrete data. We also achieve high quality generations on CIFAR-10 and LSUN bedrooms.
科研通智能强力驱动
Strongly Powered by AbleSci AI