A semantic-driven coupled network for infrared and visible image fusion

计算机科学特征（语言学）人工智能融合模式识别（心理学）分割过程（计算）像素计算机视觉模态（人机交互）代表（政治）语义特征语言学哲学政治政治学法学操作系统

作者

Xiaowen Liu,Hongtao Huo,Jing Li,Shan Pang,Bowen Zheng

出处

期刊：Information Fusion [Elsevier BV]
日期：2024-03-11 卷期号：108: 102352-102352 被引量：20

标识

DOI：10.1016/j.inffus.2024.102352

摘要

In order to be adapted to high-level vision tasks, several infrared and visible image fusion methods cascade with the downstream network to enhance the semantic information of fusion results. However, due to the feature-level heterogeneities between fusion and downstream tasks, these methods suffer from the loss of pixel-level information and incomplete reconstruction of semantic-level information. To further improve the performance of fusion images in high-level vision tasks, we propose a semantic-driven coupled network for infrared and visible image fusion, terms as SDCFusion. Firstly, to address feature heterogeneity, we couple the segmentation and fusion networks into a joint framework such that both networks share the multi-level cross-modality coupled features. Based on the joint optimization of dual tasks, a joint action between fusion and downstream tasks is formed to force the cross-modality coupled features modeled on both pixel domain and semantic domain. Subsequently, to guide the semantic information reconstruction, we cascade two networks to form the semantic-based driven action, which continuously optimizes the fusion image to achieve semantic representation capacity. In addition, we introduce an adaptive training strategy to reduce the complexity of dual-task training. Specifically, an mIoU-based semantic measurement weight is designed to balance the joint action and driven action throughout the training process. We evaluate our method at both pixel information and semantic information levels, respectively. The qualitative and quantitative experiments verify the superiority of SDCFusion in terms of visual effects and metrics. The object detection and semantic segmentation experiments demonstrate that SDCFusion achieves superior performance in high-level vision tasks. The source code is available at https://github.com/XiaoW-Liu/SDCFusion.

求助该文献

最长约 10秒，即可获得该文献文件

A semantic-driven coupled network for infrared and visible image fusion

今日热心研友