计算机科学
行人检测
多光谱图像
模态(人机交互)
模式
人工智能
过程(计算)
保险丝(电气)
计算机视觉
行人
特征(语言学)
编码(集合论)
模式识别(心理学)
哲学
工程类
社会学
电气工程
操作系统
集合(抽象数据类型)
程序设计语言
语言学
社会科学
运输工程
作者
Kailai Zhou,Linsen Chen,Xun Cao
标识
DOI:10.1007/978-3-030-58523-5_46
摘要
Multispectral pedestrian detection is capable of adapting to insufficient illumination conditions by leveraging color-thermal modalities. On the other hand, it is still lacking of in-depth insights on how to fuse the two modalities effectively. Compared with traditional pedestrian detection, we find multispectral pedestrian detection suffers from modality imbalance problems which will hinder the optimization process of dual-modality network and depress the performance of detector. Inspired by this observation, we propose Modality Balance Network (MBNet) which facilitates the optimization process in a much more flexible and balanced manner. Firstly, we design a novel Differential Modality Aware Fusion (DMAF) module to make the two modalities complement each other. Secondly, an illumination aware feature alignment module selects complementary features according to the illumination conditions and aligns the two modality features adaptively. Extensive experimental results demonstrate MBNet outperforms the state-of-the-arts on both the challenging KAIST and CVC-14 multispectral pedestrian datasets in terms of the accuracy and the computational efficiency. Code is available at https://github.com/CalayZhou/MBNet .
科研通智能强力驱动
Strongly Powered by AbleSci AI