计算机科学
忠诚
人工智能
过程(计算)
管道(软件)
水准点(测量)
噪音(视频)
扩散
图像(数学)
降噪
计算机视觉
编码(集合论)
电信
物理
大地测量学
集合(抽象数据类型)
热力学
程序设计语言
地理
操作系统
作者
Xingguo Lv,Xingbo Dong,Zhe Jin,Hui Zhang,Siyi Song,Xuejun Li
标识
DOI:10.1007/978-981-99-8552-4_11
摘要
Low-light image enhancement is a challenging yet beneficial task in computer vision that aims to improve the quality of images captured under poor illumination conditions. It involves addressing difficulties such as color distortions and noise, which often degrade the visual fidelity of low-light images. Although tremendous CNN-based and ViT-based approaches have been proposed, the potential of diffusion models in this domain remains unexplored. This paper presents L $$^2$$ DM, a novel framework for low-light image enhancement using diffusion models. Since L $$^2$$ DM falls into the category of latent diffusion models, it can reduce computational requirements through denoising and the diffusion process in latent space. Conditioning inputs are essential for guiding the enhancement process, therefore, a new ViT-based network called ViTCondNet is introduced to efficiently incorporate conditioning low-light inputs into the image generation pipeline. Extensive experiments on benchmark LOL datasets demonstrate L $$^2$$ DM's state-of-the-art performance compared to diffusion-based counterparts. The L $$^2$$ DM source code is available on GitHub for reproducibility and further research.
科研通智能强力驱动
Strongly Powered by AbleSci AI