计算机科学
人工智能
分割
编码(集合论)
透视图(图形)
模式识别(心理学)
条件随机场
光学(聚焦)
网(多面体)
卷积神经网络
相似性(几何)
图像(数学)
计算机视觉
数学
几何学
物理
集合(抽象数据类型)
光学
程序设计语言
作者
Junyu Gao,Qi Wang,Xuelong Li
标识
DOI:10.1109/tcsvt.2019.2919139
摘要
Crowd counting from a single image is a challenging task due to high appearance similarity, perspective changes, and severe congestion. Many methods only focus on the local appearance features and they cannot handle the aforementioned challenges. In order to tackle them, we propose a perspective crowd counting network (PCC Net), which consists of three parts: 1) density map estimation (DME) focuses on learning very local features of density map estimation; 2) random high-level density classification (R-HDC) extracts global features to predict the coarse density labels of random patches in images; and 3) fore-/background segmentation (FBS) encodes mid-level features to segments the foreground and background. Besides, the Down, Up, Left, and Right (DULR) module is embedded in PCC Net to encode the perspective changes on four directions (DULR). The proposed PCC Net is verified on five mainstream datasets, which achieves the state-of-the-art performance on the one and attains the competitive results on the other four datasets. The source code is available at https://github.com/gjy3035/PCC-Net.
科研通智能强力驱动
Strongly Powered by AbleSci AI