To Choose or to Fuse? Scale Selection for Crowd Counting

特征(语言学) 计算机科学 比例(比率) 特征选择 保险丝(电气) 人工智能 像素 模式识别(心理学) 选择(遗传算法) 棱锥(几何) 航程(航空) 机器学习 数据挖掘 数学 工程类 航空航天工程 哲学 物理 电气工程 量子力学 语言学 几何学
作者
Qingyu Song,Chang’an Wang,Yabiao Wang,Ying Tai,Chengjie Wang,Jilin Li,Jian Wu,Jiayi Ma
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence [Association for the Advancement of Artificial Intelligence (AAAI)]
卷期号:35 (3): 2576-2583 被引量:119
标识
DOI:10.1609/aaai.v35i3.16360
摘要

In this paper, we address the large scale variation problem in crowd counting by taking full advantage of the multi-scale feature representations in a multi-level network. We implement such an idea by keeping the counting error of a patch as small as possible with a proper feature level selection strategy, since a specific feature level tends to perform better for a certain range of scales. However, without scale annotations, it is sub-optimal and error-prone to manually assign the predictions for heads of different scales to specific feature levels. Therefore, we propose a Scale-Adaptive Selection Network (SASNet), which automatically learns the internal correspondence between the scales and the feature levels. Instead of directly using the predictions from the most appropriate feature level as the final estimation, our SASNet also considers the predictions from other feature levels via weighted average, which helps to mitigate the gap between discrete feature levels and continuous scale variation. Since the heads in a local patch share roughly a same scale, we conduct the adaptive selection strategy in a patch-wise style. However, pixels within a patch contribute different counting errors due to the various difficulty degrees of learning. Thus, we further propose a Pyramid Region Awareness Loss (PRA Loss) to recursively select the most hard sub-regions within a patch until reaching the pixel level. With awareness of whether the parent patch is over-estimated or under-estimated, the fine-grained optimization with the PRA Loss for these region-aware hard pixels helps to alleviate the inconsistency problem between training target and evaluation metric. The state-of-the-art results on four datasets demonstrate the superiority of our approach. The code will be available at: https://github.com/TencentYoutuResearch/CrowdCounting-SASNet.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
xuwen发布了新的文献求助10
刚刚
yuwen发布了新的文献求助10
刚刚
研友_n2rbrn发布了新的文献求助10
刚刚
XIA发布了新的文献求助10
1秒前
爆米花应助贝尔摩德采纳,获得10
1秒前
1秒前
晚风完成签到 ,获得积分10
2秒前
孙世界发布了新的文献求助30
2秒前
2秒前
xh完成签到,获得积分20
3秒前
3秒前
3秒前
3秒前
李健的小迷弟应助皎皎采纳,获得10
3秒前
3秒前
锅锅发布了新的文献求助10
4秒前
4秒前
4秒前
4秒前
4秒前
5秒前
Dora发布了新的文献求助10
5秒前
6秒前
6秒前
忘川完成签到 ,获得积分10
6秒前
6秒前
7秒前
7秒前
MJJ发布了新的文献求助10
8秒前
柚木完成签到,获得积分10
9秒前
洛城l发布了新的文献求助10
9秒前
Jasper应助整点薯条采纳,获得10
9秒前
Elio发布了新的文献求助30
9秒前
核桃发布了新的文献求助10
9秒前
9秒前
WXY发布了新的文献求助10
9秒前
踏实的代曼完成签到,获得积分10
9秒前
今后应助qqshyx采纳,获得10
10秒前
10秒前
xxc完成签到,获得积分10
10秒前
高分求助中
The Wiley Blackwell Companion to Diachronic and Historical Linguistics 3000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
Decentring Leadership 800
Signals, Systems, and Signal Processing 610
脑电大模型与情感脑机接口研究--郑伟龙 500
Genera Orchidacearum Volume 4: Epidendroideae, Part 1 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6288477
求助须知:如何正确求助?哪些是违规求助? 8107106
关于积分的说明 16959411
捐赠科研通 5353419
什么是DOI,文献DOI怎么找? 2844758
邀请新用户注册赠送积分活动 1821969
关于科研通互助平台的介绍 1678135