素描
计算机科学
面子(社会学概念)
草图识别
人工智能
领域(数学)
失真(音乐)
质量(理念)
自然语言处理
人机交互
情报检索
语言学
算法
数学
手势识别
放大器
纯数学
哲学
手势
带宽(计算)
认识论
计算机网络
作者
Mengdi Dong,Chunlei Peng,Decheng Liu,Yu Zheng,Nannan Wang,Xinbo Gao
标识
DOI:10.1109/ijcb54206.2022.10007972
摘要
This paper proposes a method of modifying the face sketch with text descriptions. Face sketch is widely used in the criminal field and digital entertainment field. Forensic painters usually draw face sketches based on descriptions provided by witnesses or clients. However, drawing a face sketch often takes lots of time and effort. Existing face sketch synthesis studies have not considered text-based sketch manipulation, and we find that applying text-driven editing methods on natural images directly to face sketches causes severe distortion of generated results. Therefore, this paper proposes a novel text-based attribute manipulation method for face sketch synthesis, named SketchCLIP. Our approach adopts text-driven attribute manipulation by using the powerful Contrastive Language-Image Pre-Training (CLIP) model, which not only conforms to the current drawing process of face sketches but also does not require tedious manual operations and allows for more diverse modifications. Besides, we design an intra-modality fine-tuning module to eliminate distortion and improve the quality of the modified face sketch. Through extensive comparison experiments on public face sketch datasets, our method is demonstrated to be very excellent in the effectiveness of the face sketch processing and the quality of modified results.
科研通智能强力驱动
Strongly Powered by AbleSci AI