计算机科学
人工智能
分割
编码器
卷积(计算机科学)
特征学习
卷积神经网络
图像分割
模式识别(心理学)
变压器
特征(语言学)
深度学习
计算机视觉
人工神经网络
工程类
语言学
哲学
电压
电气工程
操作系统
作者
Ke Tang,Xiaofei Yang,Xiaofeng Zhang,Weijia Cao,Chunfeng Li,Sihuan Li
标识
DOI:10.1109/medai59581.2023.00064
摘要
Deep learning-based methods have achieved significant success in medical image segmentation. However, the intrinsic locality of convolutional operations limits the learning of global semantic information. In this paper, we propose a novel method, namely VAC-UNet that is a convolutional transformer U-Net architecture for enhanced global representation. Specially, we introduce a novel Visual Attention Convolution (VAC) block integrating transformers and CNNs for joint local-global feature learning. VAC blocks replace standard convolutions in the encoder-decoder backbone. Asymmetric application on encoders improves localization. Additionally, we design Visual Attention Gates using learned convolution filters to suppress irrelevant regions in skip connections. Finally, we evaluated VAC-UNet on two medical image segmentation tasks against state-of-the-art methods. The results demonstrate VAC blocks effectively capture global context while maintaining local precision. The visual attention improves feature propagation for better segmentation.
科研通智能强力驱动
Strongly Powered by AbleSci AI