去模糊
计算机科学
变压器
计算机视觉
人工智能
平滑的
运动模糊
图像复原
计算
像素
图像处理
计算机图形学(图像)
图像(数学)
算法
量子力学
物理
电压
作者
Fu-Jen Tsai,Yan-Tsung Peng,Yen‐Yu Lin,Chung-Chi Tsai,Chia‐Wen Lin
标识
DOI:10.1007/978-3-031-19800-7_9
摘要
Images taken in dynamic scenes may contain unwanted motion blur, which significantly degrades visual quality. Such blur causes short- and long-range region-specific smoothing artifacts that are often directional and non-uniform, which is difficult to be removed. Inspired by the current success of transformers on computer vision and image processing tasks, we develop, Stripformer, a transformer-based architecture that constructs intra- and inter-strip tokens to reweight image features in the horizontal and vertical directions to catch blurred patterns with different orientations. It stacks interlaced intra-strip and inter-strip attention layers to reveal blur magnitudes. In addition to detecting region-specific blurred patterns of various orientations and magnitudes, Stripformer is also a token-efficient and parameter-efficient transformer model, demanding much less memory usage and computation cost than the vanilla transformer but works better without relying on tremendous training data. Experimental results show that Stripformer performs favorably against state-of-the-art models in dynamic scene deblurring.
科研通智能强力驱动
Strongly Powered by AbleSci AI