计算机科学
人工智能
模式识别(心理学)
特征(语言学)
特征提取
班级(哲学)
比例(比率)
卷积(计算机科学)
相似性(几何)
匹配(统计)
对象(语法)
数据挖掘
图像(数学)
数学
人工神经网络
哲学
语言学
物理
统计
量子力学
作者
Yutian Wang,Bin Yang,Xi Wang,Chao Liang,Jun Chen
出处
期刊:Neural Networks
[Elsevier]
日期:2024-04-01
卷期号:172: 106126-106126
被引量:2
标识
DOI:10.1016/j.neunet.2024.106126
摘要
This paper studies the class-agnostic counting problem, which aims to count objects regardless of their class, and relies only on a limited number of exemplar objects. Existing methods usually extract visual features from query and exemplar images, compute similarity between them using convolution operations, and finally use this information to estimate object counts. However, these approaches often overlook the scale information of the exemplar objects, leading to lower counting accuracy for objects with multi-scale characteristics. Additionally, convolution operations are local linear matching processes that may result in a loss of semantic information, which can limit the performance of the counting algorithm. To address these issues, we devise a new scale-aware transformer-based feature fusion module that integrates visual and scale information of exemplar objects and models similarity between samples and queries using cross-attention. Finally, we propose an object counting algorithm based on a feature extraction backbone, a feature fusion module and a density map regression head, called SATCount. Our experiments on the FSC-147 and the CARPK demonstrate that our model outperforms the state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI