SwinWave-SR: Multi-scale lightweight underwater image super-resolution

计算机科学 水下 人工智能 卷积神经网络 联营 计算机视觉 计算机工程 实时计算 海洋学 地质学
作者
Fayaz Ali Dharejo,Iyyakutti Iyappan Ganapathi,Muhammad Zawish,Basit Alawode,Moath Alathbah,Naoufel Werghi,Sajid Javed
出处
期刊:Information Fusion [Elsevier]
卷期号:103: 102127-102127 被引量:8
标识
DOI:10.1016/j.inffus.2023.102127
摘要

The resource-limited nature of underwater vision equipment leads to poor, otherwise low-resolution information affecting the downstream underwater robotics and ocean engineering tasks. Underwater Image Enhancement (UIE) methods have emerged, particularly Super-Resolution (SR), to tackle the aforementioned challenge by restoring the corresponding low-resolution image to a high-quality counterpart. Vision Transformers (ViTs) have recently been employed for SR tasks thanks to their superior performance over mainstream convolution neural networks. The success of ViTs is largely due to their self-attention mechanism; however, they may encounter challenges in dealing with severe and unpredictable degradation in underwater imaging. In contrast, Multi-scale ViTs (MViTs) variants such as the Swin transformers have overcome that challenge by preserving long-range dependencies over multi-scale feature hierarchies through evolving channel capacity. MViTs tend to induce spatial efficiency through classical down-sampling, such as average pooling over key/values, which results in an inevitable loss of high-frequency components. To address this lack, in the current work, we propose a novel algorithm, SwinWave-SR, for efficient and accurate multi-scale SR for underwater images. Our proposed algorithm is based on Swin transformer consisting of a wavelet block to restrict the information drop by downsampling in an invertible fashion. Consequently, the key components are preserved to assist self-attention learning while reducing its computational cost simultaneously. To further complement it, we explore a prominent compression regime, namely the Lottery Ticket Hypothesis (LTH), to discover a lightweight sub-network with competitive performance to its original model by reducing computational costs up to 70.44%. Overall, SwinWave-SR improves peak signal-to-noise ratio (PSNR) by 0.95 dB ∼ 2.23 dB compared to the state-of-the-art SwinIR while reducing the number of parameters by 29.56% and the calculation cost by 18.734%. Experimental results show that the proposed SwinWave-SR method outperforms the state-of-the-art SR methods on four benchmark underwater datasets and significantly improves PSNR and structural similarity index (SSIM).

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
考研预备完成签到 ,获得积分10
1秒前
11完成签到,获得积分10
2秒前
2秒前
3秒前
3秒前
4秒前
4秒前
4秒前
儒雅从安发布了新的文献求助10
4秒前
5秒前
852应助我爱科研采纳,获得10
5秒前
5秒前
坦率的静珊完成签到,获得积分10
5秒前
6秒前
6秒前
lailai发布了新的文献求助10
7秒前
疯丫头完成签到,获得积分10
7秒前
7秒前
aismile1111给aismile1111的求助进行了留言
8秒前
adkdad完成签到,获得积分10
8秒前
做梦发布了新的文献求助10
8秒前
红红啊红红完成签到,获得积分10
8秒前
赘婿应助逆水行舟采纳,获得10
9秒前
9秒前
爆米花应助friends采纳,获得10
9秒前
眰恦完成签到 ,获得积分10
12秒前
lemon发布了新的文献求助10
12秒前
lili发布了新的文献求助10
13秒前
huangjs完成签到,获得积分10
14秒前
14秒前
14秒前
14秒前
Smilecong发布了新的文献求助10
15秒前
今后应助空大的石头人采纳,获得10
15秒前
lensray发布了新的文献求助10
15秒前
lailai完成签到,获得积分10
17秒前
虚心稚晴发布了新的文献求助10
19秒前
20秒前
鄂霸完成签到,获得积分10
20秒前
清秀的语山完成签到 ,获得积分10
20秒前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
【本贴是提醒信息,请勿应助】请在求助之前详细阅读求助说明!!!! 20000
Evolution 4000
좌파는 어떻게 좌파가 됐나:한국 급진노동운동의 형성과 궤적 2500
Sustainability in Tides Chemistry 1500
La Chine révolutionnaire d'aujourd'hui / Van Min, Kang Hsin 1000
TM 5-855-1(Fundamentals of protective design for conventional weapons) 1000
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3037504
求助须知:如何正确求助?哪些是违规求助? 2696409
关于积分的说明 7356900
捐赠科研通 2338213
什么是DOI,文献DOI怎么找? 1237705
科研通“疑难数据库(出版商)”最低求助积分说明 602559
版权声明 595065