计算机科学
人工智能
对抗制
比例(比率)
透视图(图形)
机器学习
人机交互
计算机视觉
量子力学
物理
作者
Hyojin Bahng,Ali Jahanian,Swami Sankaranarayanan,Phillip Isola
出处
期刊:Cornell University - arXiv
日期:2022-01-01
被引量:20
标识
DOI:10.48550/arxiv.2203.17274
摘要
We investigate the efficacy of visual prompting to adapt large-scale models in vision. Following the recent approach from prompt tuning and adversarial reprogramming, we learn a single image perturbation such that a frozen model prompted with this perturbation performs a new task. Through comprehensive experiments, we demonstrate that visual prompting is particularly effective for CLIP and robust to distribution shift, achieving performance competitive with standard linear probes. We further analyze properties of the downstream dataset, prompt design, and output transformation in regard to adaptation performance. The surprising effectiveness of visual prompting provides a new perspective on adapting pre-trained models in vision. Code is available at http://hjbahng.github.io/visual_prompting .
科研通智能强力驱动
Strongly Powered by AbleSci AI