计算机科学
人工智能
比例(比率)
棱锥(几何)
相似性(几何)
核(代数)
光学(聚焦)
模式识别(心理学)
特征(语言学)
像素
编码(集合论)
图像(数学)
计算机视觉
数学
地理
地图学
哲学
物理
光学
组合数学
集合(抽象数据类型)
程序设计语言
语言学
几何学
作者
Zheyu Zhang,Yurui Zhu,Xueyang Fu,Zhiwei Xiong,Zheng-Jun Zha,Feng Wu
标识
DOI:10.1145/3474085.3475444
摘要
Albeit existing deep learning-based image de-raining methods have achieved promising results, most of them only extract single scale features, and neglect the fact that similar rain streaks appear repeatedly across different scales. Therefore, this paper aims to explore the cross-scale cues in a multi-scale fashion. Specifically, we first introduce an adaptive-kernel pyramid to provide effective multi-scale information. Then, we design two cross-scale similarity attention blocks (CSSABs) to search spatial and channel relationships between two scales, respectively. The spatial CSSAB explores the spatial similarity between pixels of cross-scale features, while the channel CSSAB emphasizes the interdependencies among cross-scale features. To further improve the diversity of features, we adopt the wavelet transformation and multi-head mechanism in CSSABs to generate multifocal features which focus on different areas. Finally, based on our CSSABs, we construct an effective multifocal attention-based cross-scale network, which exhaustively utilizes the cross-scale correlations of both rain streaks and background, to achieve image de-raining. Experiments show the superiority of our network over state-of-the-art image de-raining approaches both qualitatively and quantitatively. The source code and pre-trained models are available at https://github.com/zhangzheyu0/Multifocal_derain.
科研通智能强力驱动
Strongly Powered by AbleSci AI