计算机科学
人工智能
分割
增采样
模式识别(心理学)
图像分割
变压器
特征(语言学)
卷积(计算机科学)
特征提取
计算机视觉
图像(数学)
人工神经网络
工程类
语言学
哲学
电压
电气工程
作者
Hao Zeng,Xinxin Shan,Yu Feng,Ying Wen
标识
DOI:10.1109/icme55011.2023.00391
摘要
U-Net and its variants have achieved impressive results in medical image segmentation. However, the downsampling operation of such U-shaped networks causes the feature maps to lose a certain degree of spatial information, and most existing methods use convolution and transformer sequentially, it is hard to extract more comprehensive feature representation of the image. In this paper, we propose a novel U-shaped segmentation network named Multi-scale Axial Attention Network (MSAANet) to solve the above problems. Specifically, we propose a cross-scale interactive attention: multi-scale axial attention (MSAA), which achieves direction-perception attention of different scales interaction. So that the downsampling deep features and the shallow features can maintain context spatial consistency. Besides, we propose a Convolution-Transformer (CT) block, which makes transformer and convolution complement each other to enhance comprehensive feature representation. We evaluate the proposed method on the public datasets Synapse and ACDC. Experimental results demonstrate that MSAANet effectively improves segmentation accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI