A semantic-driven coupled network for infrared and visible image fusion

计算机科学 特征(语言学) 人工智能 融合 模式识别(心理学) 分割 过程(计算) 像素 计算机视觉 模态(人机交互) 代表(政治) 语义特征 语言学 哲学 政治 政治学 法学 操作系统
作者
Xiaowen Liu,Hongtao Huo,Jing Li,Shan Pang,Bowen Zheng
出处
期刊:Information Fusion [Elsevier BV]
卷期号:108: 102352-102352 被引量:71
标识
DOI:10.1016/j.inffus.2024.102352
摘要

In order to be adapted to high-level vision tasks, several infrared and visible image fusion methods cascade with the downstream network to enhance the semantic information of fusion results. However, due to the feature-level heterogeneities between fusion and downstream tasks, these methods suffer from the loss of pixel-level information and incomplete reconstruction of semantic-level information. To further improve the performance of fusion images in high-level vision tasks, we propose a semantic-driven coupled network for infrared and visible image fusion, terms as SDCFusion. Firstly, to address feature heterogeneity, we couple the segmentation and fusion networks into a joint framework such that both networks share the multi-level cross-modality coupled features. Based on the joint optimization of dual tasks, a joint action between fusion and downstream tasks is formed to force the cross-modality coupled features modeled on both pixel domain and semantic domain. Subsequently, to guide the semantic information reconstruction, we cascade two networks to form the semantic-based driven action, which continuously optimizes the fusion image to achieve semantic representation capacity. In addition, we introduce an adaptive training strategy to reduce the complexity of dual-task training. Specifically, an mIoU-based semantic measurement weight is designed to balance the joint action and driven action throughout the training process. We evaluate our method at both pixel information and semantic information levels, respectively. The qualitative and quantitative experiments verify the superiority of SDCFusion in terms of visual effects and metrics. The object detection and semantic segmentation experiments demonstrate that SDCFusion achieves superior performance in high-level vision tasks. The source code is available at https://github.com/XiaoW-Liu/SDCFusion.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
ding关注了科研通微信公众号
刚刚
Water完成签到,获得积分10
刚刚
bkagyin应助后悔药不可用采纳,获得10
刚刚
1秒前
整齐乐驹发布了新的文献求助10
2秒前
一一发布了新的文献求助100
3秒前
5秒前
曹超国发布了新的文献求助10
5秒前
MP应助沉默的下雨采纳,获得50
6秒前
6秒前
田様应助shimmer采纳,获得10
7秒前
领导范儿应助shimmer采纳,获得10
7秒前
大模型应助shimmer采纳,获得10
7秒前
Akim应助www采纳,获得10
8秒前
Wss完成签到 ,获得积分10
8秒前
BAMBOO完成签到,获得积分10
8秒前
淡定夜山完成签到,获得积分10
9秒前
科研小白完成签到,获得积分10
9秒前
爆米花应助卜谷雪采纳,获得10
9秒前
9秒前
10秒前
一点点关注了科研通微信公众号
10秒前
淡定夜山发布了新的文献求助10
12秒前
工藤应助hsialy采纳,获得10
12秒前
汉堡包应助bazinga采纳,获得30
12秒前
Song君发布了新的文献求助10
15秒前
16秒前
17秒前
临济知阳完成签到,获得积分10
18秒前
Owen应助冥冥之极为昭昭采纳,获得10
19秒前
李健应助顺利白竹采纳,获得10
20秒前
www发布了新的文献求助10
20秒前
20秒前
QYQ完成签到 ,获得积分10
21秒前
22秒前
uiai完成签到,获得积分20
22秒前
26秒前
娇1994完成签到,获得积分10
26秒前
26秒前
29秒前
高分求助中
Signals, Systems, and Signal Processing 610
Annie Ernaux: De la perte au corps glorieux 600
Petrology and Plate Tectonics,2025 500
Cardiopulmonary Bypass and Mechanical Support: Principles and Practice, Fifth Edition 400
Circular Polar Constellations Providing Continuous Single or Multiple Coverage Above a Specified Latitude 400
Burger's Medicinal Chemistry and Drug Discovery 400
Probability and Stochastic Processes 333
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6749118
求助须知:如何正确求助?哪些是违规求助? 8478625
关于积分的说明 18082015
捐赠科研通 6023947
什么是DOI,文献DOI怎么找? 3006044
邀请新用户注册赠送积分活动 1982921
关于科研通互助平台的介绍 1950627