适配器(计算)
分割
计算机科学
人工智能
计算机视觉
影子(心理学)
任务(项目管理)
目标检测
图像分割
计算机硬件
工程类
心理学
系统工程
心理治疗师
作者
Tianrun Chen,Lanyun Zhu,Chaotao Ding,Runlong Cao,Yan Wang,Zejian Li,Lingyun Sun,Papa Mao,Ying Zang
出处
期刊:Cornell University - arXiv
日期:2023-01-01
被引量:23
标识
DOI:10.48550/arxiv.2304.09148
摘要
The emergence of large models, also known as foundation models, has brought significant advancements to AI research. One such model is Segment Anything (SAM), which is designed for image segmentation tasks. However, as with other foundation models, our experimental findings suggest that SAM may fail or perform poorly in certain segmentation tasks, such as shadow detection and camouflaged object detection (concealed object detection). This study first paves the way for applying the large pre-trained image segmentation model SAM to these downstream tasks, even in situations where SAM performs poorly. Rather than fine-tuning the SAM network, we propose \textbf{SAM-Adapter}, which incorporates domain-specific information or visual prompts into the segmentation network by using simple yet effective adapters. By integrating task-specific knowledge with general knowledge learnt by the large model, SAM-Adapter can significantly elevate the performance of SAM in challenging tasks as shown in extensive experiments. We can even outperform task-specific network models and achieve state-of-the-art performance in the task we tested: camouflaged object detection, shadow detection. We also tested polyp segmentation (medical image segmentation) and achieves better results. We believe our work opens up opportunities for utilizing SAM in downstream tasks, with potential applications in various fields, including medical image processing, agriculture, remote sensing, and more.
科研通智能强力驱动
Strongly Powered by AbleSci AI