计算机科学
峰值信噪比
算法
编码器
图像融合
人工智能
卷积神经网络
均方误差
模式识别(心理学)
图像(数学)
数学
统计
操作系统
作者
Yangyang Zhao,Qingchun Zheng,Peihao Zhu,Xu Zhang,Wenpeng Ma
出处
期刊:IEEE Transactions on Circuits and Systems for Video Technology
[Institute of Electrical and Electronics Engineers]
日期:2023-07-19
卷期号:34 (3): 1712-1725
被引量:7
标识
DOI:10.1109/tcsvt.2023.3296745
摘要
Multimodal image fusion is one of the important research directions in the field of multimodal fusion. This technique can realize image and data enhancement by using complementary multimodal images and be widely used in medicine, industry, security and fire protection, automatic driving and consumer electronics. In this work, we propose a transformer-based universal fusion (TUFusion) algorithm, and it has a multidomain fusion capability. The advantage of TUFusion algorithm is the design of hybrid transformer and convolutional neural network (CNN) encoder structure and a new composite attention fusion strategy, which has the ability of global and local information integration. Compared with the classical state-of-the-art multimodal image fusion methods, the experimental result on multidomain data sets showed that the TUFusion algorithm has certain universality in image fusion. Meanwhile, the TUFusion algorithm we proposed achieves good values on peak signal to noise ratio (PSNR), root mean square error (RMSE) and structural similarity index measure (SSIM). The code of the TUFusion algorithm in this article is available at https://github.com/windrunners/TUFusion .
科研通智能强力驱动
Strongly Powered by AbleSci AI