计算机科学
分割
人工智能
编码器
卷积神经网络
模式识别(心理学)
图像分割
特征(语言学)
变压器
深度学习
掷骰子
工程类
电压
数学
电气工程
操作系统
哲学
语言学
几何学
作者
Jing Zhang,Qiuge Qin,Qi Ye,Tong Ruan
标识
DOI:10.1016/j.compbiomed.2022.106516
摘要
Medical image segmentation is an essential task in clinical diagnosis and case analysis. Most of the existing methods are based on U-shaped convolutional neural networks (CNNs), and one of disadvantages is that the long-term dependencies and global contextual connections cannot be effectively established, which results in inaccuracy segmentation. For fully using low-level features to enhance global features and reduce the semantic gap between encoding and decoding stages, we propose a novel Swin Transformer boosted U-Net (ST-Unet) for medical image processing in this paper, in which Swin Transformer and CNNs are used as encoder and decoder respectively. Then a novel Cross-Layer Feature Enhancement (CLFE) module is proposed to realize cross-layer feature learning, and a Spatial and Channel Squeeze & Excitation module is adopted to highlight the saliency of specific regions. Finally, we learn the features fused by the CLFE module through CNNs to recover low-level features and localize local features for realizing more accurate semantic segmentation. Experiments on widely used public datasets Synapse and ISIC 2018 prove that our proposed ST-Unet can achieve 78.86 of dice and 0.9243 of recall performance, outperforming most current medical image segmentation methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI