A semantic-driven coupled network for infrared and visible image fusion

计算机科学 特征(语言学) 人工智能 融合 模式识别(心理学) 分割 过程(计算) 像素 计算机视觉 模态(人机交互) 代表(政治) 语义特征 语言学 哲学 政治 政治学 法学 操作系统
作者
Xiaowen Liu,Hongtao Huo,Jing Li,Shan Pang,Bowen Zheng
出处
期刊:Information Fusion [Elsevier BV]
卷期号:108: 102352-102352 被引量:71
标识
DOI:10.1016/j.inffus.2024.102352
摘要

In order to be adapted to high-level vision tasks, several infrared and visible image fusion methods cascade with the downstream network to enhance the semantic information of fusion results. However, due to the feature-level heterogeneities between fusion and downstream tasks, these methods suffer from the loss of pixel-level information and incomplete reconstruction of semantic-level information. To further improve the performance of fusion images in high-level vision tasks, we propose a semantic-driven coupled network for infrared and visible image fusion, terms as SDCFusion. Firstly, to address feature heterogeneity, we couple the segmentation and fusion networks into a joint framework such that both networks share the multi-level cross-modality coupled features. Based on the joint optimization of dual tasks, a joint action between fusion and downstream tasks is formed to force the cross-modality coupled features modeled on both pixel domain and semantic domain. Subsequently, to guide the semantic information reconstruction, we cascade two networks to form the semantic-based driven action, which continuously optimizes the fusion image to achieve semantic representation capacity. In addition, we introduce an adaptive training strategy to reduce the complexity of dual-task training. Specifically, an mIoU-based semantic measurement weight is designed to balance the joint action and driven action throughout the training process. We evaluate our method at both pixel information and semantic information levels, respectively. The qualitative and quantitative experiments verify the superiority of SDCFusion in terms of visual effects and metrics. The object detection and semantic segmentation experiments demonstrate that SDCFusion achieves superior performance in high-level vision tasks. The source code is available at https://github.com/XiaoW-Liu/SDCFusion.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
AOTUMAN发布了新的文献求助10
1秒前
1秒前
随风发布了新的文献求助10
1秒前
ting完成签到 ,获得积分10
2秒前
2秒前
3秒前
orixero应助summer烨采纳,获得30
4秒前
lili完成签到,获得积分10
4秒前
朴实山兰发布了新的文献求助10
5秒前
yy发布了新的文献求助10
6秒前
科研通AI6.4应助VV采纳,获得10
6秒前
阿刁发布了新的文献求助10
6秒前
CodeCraft应助yangtao采纳,获得10
7秒前
小高同学发布了新的文献求助10
7秒前
lili发布了新的文献求助10
7秒前
zhangzhang发布了新的文献求助10
8秒前
10秒前
10秒前
Re完成签到,获得积分10
10秒前
小蘑菇应助泡泡采纳,获得10
12秒前
13秒前
nicholas完成签到,获得积分10
13秒前
YT发布了新的文献求助10
14秒前
16秒前
jinyu完成签到 ,获得积分10
16秒前
高挑的魔镜完成签到 ,获得积分10
18秒前
18秒前
AOTUMAN完成签到,获得积分10
19秒前
wanci应助Xx采纳,获得10
20秒前
ding应助Xx采纳,获得10
20秒前
所所应助Xx采纳,获得10
20秒前
万能图书馆应助Xx采纳,获得10
20秒前
李健的小迷弟应助Xx采纳,获得10
20秒前
我是老大应助Xx采纳,获得10
20秒前
xbb0905发布了新的文献求助10
21秒前
21秒前
22秒前
SsHh完成签到,获得积分10
22秒前
哩哩完成签到,获得积分10
22秒前
22秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
卤化钙钛矿人工突触的研究 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 610
简明药物化学习题答案 500
脑电大模型与情感脑机接口研究--郑伟龙 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6275283
求助须知:如何正确求助?哪些是违规求助? 8095044
关于积分的说明 16922145
捐赠科研通 5345223
什么是DOI,文献DOI怎么找? 2841901
邀请新用户注册赠送积分活动 1819135
关于科研通互助平台的介绍 1676400