计算机科学
发电机(电路理论)
高保真
图像(数学)
弹丸
忠诚
特征(语言学)
人工智能
无线电频率
计算机视觉
模式识别(心理学)
声学
功率(物理)
电信
哲学
物理
有机化学
化学
量子力学
语言学
作者
Mengping Yang,Zhe Wang,Ziqiu Chi,Wei Feng
标识
DOI:10.1007/978-3-031-19784-0_1
摘要
Existing few-shot image generation approaches typically employ fusion-based strategies, either on the image or the feature level, to produce new images. However, previous approaches struggle to synthesize high-frequency signals with fine details, deteriorating the synthesis quality. To address this, we propose WaveGAN, a frequency-aware model for few-shot image generation. Concretely, we disentangle encoded features into multiple frequency components and perform low-frequency skip connections to preserve outline and structural information. Then we alleviate the generator’s struggles of synthesizing fine details by employing high-frequency skip connections, thus providing informative frequency information to the generator. Moreover, we utilize a frequency $$L_1$$ -loss on the generated and real images to further impede frequency information loss. Extensive experiments demonstrate the effectiveness and advancement of our method on three datasets. Noticeably, we achieve new state-of-the-art with FID 42.17, LPIPS 0.3868, FID 30.35, LPIPS 0.5076, and FID 4.96, LPIPS 0.3822 respectively on Flower, Animal Faces, and VGGFace. GitHub: https://github.com/kobeshegu/ECCV2022_WaveGAN .
科研通智能强力驱动
Strongly Powered by AbleSci AI