计算机科学
人工智能
棱锥(几何)
图像(数学)
语义相似性
模式识别(心理学)
合成
集合(抽象数据类型)
构造(python库)
自然语言处理
情报检索
数学
几何学
程序设计语言
作者
Assaf Shocher,Yossi Gandelsman,Inbar Mosseri,Michal Yarom,Michal Irani,William T. Freeman,Tali Dekel
出处
期刊:Cornell University - arXiv
日期:2020-01-01
标识
DOI:10.48550/arxiv.2003.06221
摘要
We present a novel GAN-based model that utilizes the space of deep features learned by a pre-trained classification model. Inspired by classical image pyramid representations, we construct our model as a Semantic Generation Pyramid -- a hierarchical framework which leverages the continuum of semantic information encapsulated in such deep features; this ranges from low level information contained in fine features to high level, semantic information contained in deeper features. More specifically, given a set of features extracted from a reference image, our model generates diverse image samples, each with matching features at each semantic level of the classification model. We demonstrate that our model results in a versatile and flexible framework that can be used in various classic and novel image generation tasks. These include: generating images with a controllable extent of semantic similarity to a reference image, and different manipulation tasks such as semantically-controlled inpainting and compositing; all achieved with the same model, with no further training.
科研通智能强力驱动
Strongly Powered by AbleSci AI