动漫
计算机科学
计算机图形学(图像)
图像(数学)
计算机视觉
人工智能
图像编辑
情报检索
作者
Zhenghao Song,Haoran Mo,Chengying Gao
摘要
Abstract Editing anime images via probabilities of attribute tags allows controlling the degree of the manipulation in an intuitive and convenient manner. Existing methods fall short in the progressive modification and preservation of unintended regions in the input image. We propose a controllable anime image editing framework based on adjusting the tag probabilities, in which a probability encoding network (PEN) is developed to encode the probabilities into features that capture continuous characteristic of the probabilities. Thus, the encoded features are able to direct the generative process of a pre‐trained diffusion model and facilitate the linear manipulation. We also introduce a local editing module that automatically identifies the intended regions and constrains the edits to be applied to those regions only, which preserves the others unchanged. Comprehensive comparisons with existing methods indicate the effectiveness of our framework in both one‐shot and linear editing modes. Results in additional applications further demonstrate the generalization ability of our approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI