计算机科学
人工智能
保险丝(电气)
噪音(视频)
降噪
光学(聚焦)
概率逻辑
图像融合
计算机视觉
模式识别(心理学)
干扰(通信)
融合
图像(数学)
特征提取
频道(广播)
电气工程
光学
物理
工程类
哲学
语言学
计算机网络
作者
Maolin Li Yiqun Li,Ronghao Pei,Tianyou Zheng,Yang Zhang,Weiwei Fu
标识
DOI:10.1016/j.eswa.2023.121664
摘要
Multi-focus image fusion (MFIF) is an image enhancement technology with broad application prospects that can effectively extend the depth-of-field of optical lenses. This paper presents a novel MFIF method, named FusionDiff, based on denoising diffusion probabilistic models (DDPM). FusionDiff uses DDPM to fuse two source images by iteratively performing multiple denoising operations. To predict noise accurately and train the model efficiently, a lightweight U-Net architecture is designed as the conditional noise predictor. FusionDiff does not depend on any specific activity level measurement method, fusion rule or complex feature extraction network. It overcomes many algorithm design and training difficulties in existing image fusion methods. FusionDiff is noise-resistant and can still produce outstanding fusion results from source images with noise interference. In addition, FusionDiff is a few-shot learning method, which makes it suitable for image fusion tasks where training samples are relatively scarce. Experiments show that FusionDiff outperforms representative state-of-the-art methods in both visual perception and quantitative metrics. The code is available at https://github.com/lmn-ning/ImageFusionhttps://github.com/lmn-ning/ImageFusion
科研通智能强力驱动
Strongly Powered by AbleSci AI