To Choose or to Fuse? Scale Selection for Crowd Counting

特征(语言学) 计算机科学 比例(比率) 特征选择 保险丝(电气) 人工智能 像素 模式识别(心理学) 选择(遗传算法) 棱锥(几何) 航程(航空) 机器学习 数据挖掘 数学 工程类 航空航天工程 哲学 物理 电气工程 量子力学 语言学 几何学
作者
Qingyu Song,Chang’an Wang,Yabiao Wang,Ying Tai,Chengjie Wang,Jilin Li,Jian Wu,Jiayi Ma
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence [Association for the Advancement of Artificial Intelligence (AAAI)]
卷期号:35 (3): 2576-2583 被引量:119
标识
DOI:10.1609/aaai.v35i3.16360
摘要

In this paper, we address the large scale variation problem in crowd counting by taking full advantage of the multi-scale feature representations in a multi-level network. We implement such an idea by keeping the counting error of a patch as small as possible with a proper feature level selection strategy, since a specific feature level tends to perform better for a certain range of scales. However, without scale annotations, it is sub-optimal and error-prone to manually assign the predictions for heads of different scales to specific feature levels. Therefore, we propose a Scale-Adaptive Selection Network (SASNet), which automatically learns the internal correspondence between the scales and the feature levels. Instead of directly using the predictions from the most appropriate feature level as the final estimation, our SASNet also considers the predictions from other feature levels via weighted average, which helps to mitigate the gap between discrete feature levels and continuous scale variation. Since the heads in a local patch share roughly a same scale, we conduct the adaptive selection strategy in a patch-wise style. However, pixels within a patch contribute different counting errors due to the various difficulty degrees of learning. Thus, we further propose a Pyramid Region Awareness Loss (PRA Loss) to recursively select the most hard sub-regions within a patch until reaching the pixel level. With awareness of whether the parent patch is over-estimated or under-estimated, the fine-grained optimization with the PRA Loss for these region-aware hard pixels helps to alleviate the inconsistency problem between training target and evaluation metric. The state-of-the-art results on four datasets demonstrate the superiority of our approach. The code will be available at: https://github.com/TencentYoutuResearch/CrowdCounting-SASNet.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
kobe完成签到,获得积分10
1秒前
茵陈完成签到,获得积分10
1秒前
re88完成签到,获得积分10
1秒前
充电宝应助太Crazy辣采纳,获得10
1秒前
1秒前
2秒前
852应助威武卿采纳,获得30
2秒前
安年完成签到,获得积分10
2秒前
谦秋宛代完成签到 ,获得积分10
2秒前
lumos发布了新的文献求助10
3秒前
perovskite发布了新的文献求助10
3秒前
慕青应助清浅时光采纳,获得10
3秒前
立青关注了科研通微信公众号
3秒前
4秒前
Boston发布了新的文献求助10
4秒前
於傲松应助唐唐采纳,获得10
4秒前
5秒前
5秒前
5秒前
orixero应助周颖采纳,获得10
5秒前
复杂冬菱完成签到 ,获得积分10
5秒前
Hi发布了新的文献求助10
6秒前
dew应助东方豁采纳,获得10
6秒前
星辰大海应助hmx采纳,获得10
6秒前
zaochuan发布了新的文献求助10
6秒前
852应助Achhz采纳,获得10
7秒前
re88发布了新的文献求助10
7秒前
田田田完成签到,获得积分10
7秒前
JamesPei应助愤怒的青旋采纳,获得10
7秒前
7秒前
ooo完成签到,获得积分10
8秒前
2微恙发布了新的文献求助10
8秒前
可爱的函函应助lim采纳,获得10
8秒前
Lucas应助dspan采纳,获得10
9秒前
yan发布了新的文献求助10
9秒前
SciGPT应助淡淡无色采纳,获得10
9秒前
安年发布了新的文献求助10
9秒前
10秒前
烟花应助董晓萱采纳,获得10
10秒前
HU发布了新的文献求助10
10秒前
高分求助中
The Wiley Blackwell Companion to Diachronic and Historical Linguistics 3000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
Decentring Leadership 800
Signals, Systems, and Signal Processing 610
脑电大模型与情感脑机接口研究--郑伟龙 500
Genera Orchidacearum Volume 4: Epidendroideae, Part 1 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6287873
求助须知:如何正确求助?哪些是违规求助? 8106534
关于积分的说明 16956922
捐赠科研通 5352809
什么是DOI,文献DOI怎么找? 2844590
邀请新用户注册赠送积分活动 1821778
关于科研通互助平台的介绍 1678050