人工智能
目标检测
计算机科学
计算机视觉
可分离空间
对象(语法)
视觉对象识别的认知神经科学
模式识别(心理学)
Viola–Jones对象检测框架
人脸检测
数学
面部识别系统
数学分析
作者
Bailing Yin,Xuying Zhang,Deng-Ping Fan,Shaohui Jiao,Ming‐Ming Cheng,Luc Van Gool,Qibin Hou
标识
DOI:10.1109/tpami.2024.3438565
摘要
How to identify and segment camouflaged objects from the background is challenging. Inspired by the multi-head self-attention in Transformers, we present a simple masked separable attention (MSA) for camouflaged object detection. We first separate the multi-head self-attention into three parts, which are responsible for distinguishing the camouflaged objects from the background using different mask strategies. Furthermore, we propose to capture high-resolution semantic representations progressively based on a simple top-down decoder with the proposed MSA to attain precise segmentation results. These structures plus a backbone encoder form a new model, dubbed CamoFormer. Extensive experiments show that CamoFormer achieves new state-of-the-art performance on three widely-used camouflaged object detection benchmarks. To better evaluate the performance of the proposed CamoFormer around the border regions, we propose to use two new metrics, i.e. BR-M and BR-F. There are on average ∼ 5% relative improvements over previous methods in terms of S-measure and weighted F-measure. Our code is available at https://github.com/HVision-NKU/CamoFormer.
科研通智能强力驱动
Strongly Powered by AbleSci AI