计算机科学
发电机(电路理论)
人工智能
对象(语法)
多样性(控制论)
语义学(计算机科学)
分割
图像(数学)
对抗制
模式识别(心理学)
计算机视觉
分辨率(逻辑)
鉴别器
生成语法
程序设计语言
电信
探测器
物理
量子力学
功率(物理)
作者
Ting-Chun Wang,Mingyu Li,Jun‐Yan Zhu,Andrew Tao,Jan Kautz,Bryan Catanzaro
出处
期刊:Computer Vision and Pattern Recognition
日期:2018-06-01
被引量:3249
标识
DOI:10.1109/cvpr.2018.00917
摘要
We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low-resolution and still far from realistic. In this work, we generate 2048 × 1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.
科研通智能强力驱动
Strongly Powered by AbleSci AI