子网
能见度
行人检测
计算机科学
检测阈值
行人
人工智能
任务(项目管理)
计算机视觉
模式识别(心理学)
算法
实时计算
工程类
计算机网络
物理
系统工程
运输工程
光学
作者
Yi Tang,Min Liu,Baopu Li,Yaonan Wang,Wanli Ouyang
出处
期刊:IEEE transactions on image processing
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:32: 3176-3187
被引量:8
标识
DOI:10.1109/tip.2023.3273853
摘要
Pedestrian detection is still a challenging task for computer vision, especially in crowded scenes where the overlaps between pedestrians tend to be large. The non-maximum suppression (NMS) plays an important role in removing the redundant false positive detection proposals while retaining the true positive detection proposals. However, the highly overlapped results may be suppressed if the threshold of NMS is lower. Meanwhile, a higher threshold of NMS will introduce a larger number of false positive results. To solve this problem, we propose an optimal threshold prediction (OTP) based NMS method that predicts a suitable threshold of NMS for each human instance. First, a visibility estimation module is designed to obtain the visibility ratio. Then, we propose a threshold prediction subnet to determine the optimal threshold of NMS automatically according to the visibility ratio and classification score. Finally, we re-formulate the objective function of the subnet and utilize the reward-guided gradient estimation algorithm to update the subnet. Comprehensive experiments on CrowdHuman and CityPersons show the superior performance of the proposed method in pedestrian detection, especially in crowded scenes.
科研通智能强力驱动
Strongly Powered by AbleSci AI