适应(眼睛)
对象(语法)
计算机科学
人工智能
计算机视觉
地理
认知心理学
心理学
神经科学
作者
Shizhou Zhang,Dexuan Kong,Yinghui Xing,Yue Lu,Lingyan Ran,Guoqiang Liang,Hexu Wang,Yanning Zhang
出处
期刊:Cornell University - arXiv
日期:2024-09-18
被引量:1
标识
DOI:10.48550/arxiv.2409.12421
摘要
Camouflaged object detection (COD) aims to segment camouflaged objects which exhibit very similar patterns with the surrounding environment. Recent research works have shown that enhancing the feature representation via the frequency information can greatly alleviate the ambiguity problem between the foreground objects and the background.With the emergence of vision foundation models, like InternImage, Segment Anything Model etc, adapting the pretrained model on COD tasks with a lightweight adapter module shows a novel and promising research direction. Existing adapter modules mainly care about the feature adaptation in the spatial domain. In this paper, we propose a novel frequency-guided spatial adaptation method for COD task. Specifically, we transform the input features of the adapter into frequency domain. By grouping and interacting with frequency components located within non overlapping circles in the spectrogram, different frequency components are dynamically enhanced or weakened, making the intensity of image details and contour features adaptively adjusted. At the same time, the features that are conducive to distinguishing object and background are highlighted, indirectly implying the position and shape of camouflaged object. We conduct extensive experiments on four widely adopted benchmark datasets and the proposed method outperforms 26 state-of-the-art methods with large margins. Code will be released.
科研通智能强力驱动
Strongly Powered by AbleSci AI