Multispectral Object Detection via Cross-Modal Conflict-Aware Learning

计算机科学 情态动词 多光谱图像 人工智能 语义学(计算机科学) 目标检测 机器学习 模式识别(心理学) 化学 高分子化学 程序设计语言
作者
Xiao He,Chang Tang,Xin Zou,Wei Zhang
标识
DOI:10.1145/3581783.3612651
摘要

Multispectral object detection has gained significant attention due to its potential in all-weather applications, particularly those involving visible (RGB) and infrared (IR) images. Despite substantial advancements in this domain, current methodologies primarily rely on rudimentary accumulation operations to combine complementary information from disparate modalities, overlooking the semantic conflicts that arise from the intrinsic heterogeneity among modalities. To address this issue, we propose a novel learning network, the Cross-modal Conflict-Aware Learning Network (CALNet), that takes into account semantic conflicts and complementary information within multi-modal input. Our network comprises two pivotal modules: the Cross-Modal Conflict Rectification Module (CCR) and the Selected Cross-modal Fusion (SCF) Module. The CCR module mitigates modal heterogeneity by examining contextual information of analogous pixels, thus alleviating multi-modal information with semantic conflicts. Subsequently, semantically coherent information is supplied to the SCF module, which fuses multi-modal features by assessing intra-modal importance to select semantically rich features and mining inter-modal complementary information. To assess the effectiveness of our proposed method, we develop a two-stream one-stage detector based on CALNet for multispectral object detection. Comprehensive experimental outcomes demonstrate that our approach considerably outperforms existing methods in resolving the cross-modal semantic conflict issue and achieving state-of-the-art accuracy in detection results.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
方俊驰发布了新的文献求助10
3秒前
4秒前
6秒前
英姑应助初夏采纳,获得10
10秒前
陈时懿发布了新的文献求助10
10秒前
清新的哈密瓜完成签到 ,获得积分10
12秒前
13秒前
吾身无拘完成签到,获得积分10
14秒前
15秒前
科研通AI2S应助认真的rain采纳,获得10
17秒前
田様应助morning采纳,获得10
18秒前
迷路以筠发布了新的文献求助10
18秒前
娃哈哈完成签到,获得积分10
20秒前
852应助糊涂的青烟采纳,获得10
22秒前
22秒前
小蘑菇应助Charail采纳,获得10
26秒前
深情安青应助c123采纳,获得10
26秒前
糖伯虎完成签到 ,获得积分10
28秒前
chengzi完成签到,获得积分10
30秒前
yiyi发布了新的文献求助10
31秒前
32秒前
33秒前
34秒前
小二郎应助柳成荫采纳,获得10
35秒前
luoyatu发布了新的文献求助10
38秒前
万能图书馆应助狗屁大侠采纳,获得10
39秒前
40秒前
蔺天宇完成签到,获得积分10
42秒前
认真的rain完成签到,获得积分10
44秒前
44秒前
44秒前
wanci应助明德zhuang采纳,获得30
45秒前
星辰大海应助luoyatu采纳,获得10
45秒前
shain发布了新的文献求助10
46秒前
新火应助Charail采纳,获得10
50秒前
yiyi完成签到,获得积分10
51秒前
初夏发布了新的文献求助10
51秒前
搬砖不断完成签到,获得积分10
52秒前
隐形曼青应助沉静的怜蕾采纳,获得10
52秒前
57秒前
高分求助中
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
A Chronicle of Small Beer: The Memoirs of Nan Green 1000
Understanding Autism and Autistic Functioning 950
From Rural China to the Ivy League: Reminiscences of Transformations in Modern Chinese History 900
Eric Dunning and the Sociology of Sport 850
QMS18Ed2 | process management. 2nd ed 800
Operative Techniques in Pediatric Orthopaedic Surgery 510
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2915256
求助须知:如何正确求助?哪些是违规求助? 2553517
关于积分的说明 6909030
捐赠科研通 2215300
什么是DOI,文献DOI怎么找? 1177645
版权声明 588353
科研通“疑难数据库(出版商)”最低求助积分说明 576466