行人检测
计算机科学
多光谱图像
块(置换群论)
特征提取
人工智能
模式识别(心理学)
特征(语言学)
行人
光学(聚焦)
比例(比率)
计算机视觉
数学
工程类
地理
语言学
哲学
物理
几何学
光学
地图学
运输工程
作者
Ying Zang,Chenglong Fu,Runlong Cao,Ying Zang,Xiao‐Jun Wu,Shigen Shen,Xiao‐Zhi Gao
标识
DOI:10.1016/j.asoc.2023.110768
摘要
Multispectral images can provide more information, so pedestrian detection based on multispectral images has received wide attention. Existing multispectral networks mainly focus on the misalignment of image pairs and the difference between modalities. However, these network structures lack effective information interaction between two feature streams and fail to consider the scale characteristics of pedestrian objects. To deal with this issue, we propose a high-performance network structure, which is called dual-stream interaction and multi-scale feature extraction network (DSI-MSE), and contains a dual-stream feature interaction (DSI) block, a multi-scale feature extraction (MSE) block and a detection (DET) block. The DSI block extracts the features through the dual-stream interaction of RGB images and thermal images, which fuses the intra-modal information and the inter-modal information. The MSE block is designed by multiple parallel branches for matching multiple scales of pedestrian, which enhances the expressiveness of features and refines richer feature expressions at different scales. Experimental results on KAIST and CVC-14 datasets demonstrate that the proposed DSI-MSE can obtain the state-of-the-art results on multi-spectral pedestrian detection tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI