素描
计算机科学
草图识别
一致性(知识库)
生成语法
人工智能
图像翻译
任务(项目管理)
翻译(生物学)
领域(数学分析)
机器学习
自然语言处理
图像(数学)
算法
手势识别
数学分析
生物化学
手势
化学
经济
基因
数学
管理
信使核糖核酸
作者
Diego Donoso,José M. Saavedra
摘要
Sketch-based understanding is involved in human communication and cognitive development, making it essential in visual perception. A specific task in this domain is sketch-to-photo translation, where a model produces realistic images from simple drawings. To this end, large paired training datasets are commonly required, which is impractical in real applications. Thus, this work studies conditional generative models for sketch-to-photo translation, overcoming the lack of training datasets by a self-supervised approach that produces sketch-photo pairs from a target catalog. Our study shows the benefit of cycle-consistency loss and UNet architectures that, together with the proposed dataset generation, improve performance in real applications like eCommerce. Our results also reveal the weakness of conditional DDPMs for generating images resembling the input sketch, even though they achieve a high FID score.
科研通智能强力驱动
Strongly Powered by AbleSci AI