Continuous Conditional Generative Adversarial Networks: Novel Empirical Losses and Label Input Mechanisms

鉴别器计算机科学发电机（电路理论）回归范畴变量水准点（测量）人工智能标量（数学）模式识别（心理学）算法机器学习数学统计功率（物理）电信物理几何学大地测量学量子力学探测器地理

作者

Xin Ding,Yongwei Wang,Zuheng Xu,William J. Welch,Z. Jane Wang

出处

期刊：IEEE Transactions on Pattern Analysis and Machine Intelligence [IEEE Computer Society]
日期：2023-07-01 卷期号：45 (7): 8143-8158 被引量：8

链接

arxiv.org arxiv.org nih.govdoi.org

标识

DOI：10.1109/tpami.2022.3228915

摘要

This article focuses on conditional generative modeling (CGM) for image data with continuous, scalar conditions (termed regression labels). We propose the first model for this task which is called continuous conditional generative adversarial network (CcGAN). Existing conditional GANs (cGANs) are mainly designed for categorical conditions (e.g., class labels). Conditioning on regression labels is mathematically distinct and raises two fundamental problems: (P1) since there may be very few (even zero) real images for some regression labels, minimizing existing empirical versions of cGAN losses (a.k.a. empirical cGAN losses) often fails in practice; and (P2) since regression labels are scalar and infinitely many, conventional label input mechanisms (e.g., combining a hidden map of the generator/discriminator with a one-hot encoded label) are not applicable. We solve these problems by: (S1) reformulating existing empirical cGAN losses to be appropriate for the continuous scenario; and (S2) proposing a naive label input (NLI) mechanism and an improved label input (ILI) mechanism to incorporate regression labels into the generator and the discriminator. The reformulation in (S1) leads to two novel empirical discriminator losses, termed the hard vicinal discriminator loss (HVDL) and the soft vicinal discriminator loss (SVDL) respectively, and a novel empirical generator loss. Hence, we propose four versions of CcGAN employing different proposed losses and label input mechanisms. The error bounds of the discriminator trained with HVDL and SVDL, respectively, are derived under mild assumptions. To evaluate the performance of CcGANs, two new benchmark datasets (RC-49 and Cell-200) are created. A novel evaluation metric ( Sliding Fréchet Inception Distance ) is also proposed to replace Intra-FID when Intra-FID is not applicable. Our extensive experiments on several benchmark datasets (i.e., RC-49, UTKFace, Cell-200, and Steering Angle with both low and high resolutions) support the following findings: the proposed CcGAN is able to generate diverse, high-quality samples from the image distribution conditional on a given regression label; and CcGAN substantially outperforms cGAN both visually and quantitatively.

求助该文献

最长约 10秒，即可获得该文献文件

Continuous Conditional Generative Adversarial Networks: Novel Empirical Losses and Label Input Mechanisms

今日热心研友