计算机科学
稳健性(进化)
行人检测
特征提取
滑动窗口协议
棱锥(几何)
行人
人工智能
探测器
特征(语言学)
模式识别(心理学)
目标检测
窗口(计算)
计算机视觉
数据挖掘
电信
物理
运输工程
工程类
操作系统
生物化学
化学
语言学
哲学
光学
基因
作者
Chi‐Yi Tsai,Run-Yu Wang,Yu‐Chen Chiu
标识
DOI:10.1016/j.neucom.2024.128357
摘要
Pedestrian detection is a critical research area in computer vision with practical applications. This paper addresses this key topic by providing a novel lightweight model named Shift Window-YOLOX (SW-YOLOX). The purpose of SW-YOLOX is to significantly enhance the robustness and real-time performance of pedestrian detection under practical application requirements. The proposed method incorporates a novel Shift Window-Mixed Attention Mechanism (SW-MAM), which combines spatial and channel attention for effective feature extraction. In addition, we introduce a novel up-sampling layer, PatchExpandingv2, to enhance spatial feature representation while maintaining computational efficiency. Furthermore, we propose a novel Shift Window-Path Aggregation Feature Pyramid Network (SW-PAFPN) to integrate with the YOLOX detector, further enhancing feature extraction and the robustness of pedestrian detection. Experimental results validated on challenging datasets such as CrowdHuman, MOT17Det, and MOT20Det demonstrate the competitive performance of the proposed SW-YOLOX compared to state-of-the-art methods and its pedestrian detection performance in crowded and complex scenes.
科研通智能强力驱动
Strongly Powered by AbleSci AI