计算机科学
分割
人工智能
图像分割
模式识别(心理学)
感受野
计算机视觉
尺度空间分割
变压器
物理
电压
量子力学
作者
Yanlin Wu,Guanglei Wang,Zhongyang Wang,Hongrui Wang,Yan Li
标识
DOI:10.1016/j.bspc.2022.103896
摘要
In recent years, Unet network based on convolution has become a general structure for medical image segmentation tasks. However, it cannot effectively model the long-distance dependence between features due to the limitation of the receptive field. The successful application of Transformer in computer vision solves the problem of the limited receptive field of neural networks. However, the computational complexity limits its further application in medical image segmentation. In addition, the self attention mechanism in Transformer only explores the spatial dimension relationship of the feature maps, and lacks the interaction with the channel dimension, which limits the performance improvement of the network. Here, we proposes DI-Unet, which develops Dimensional Interactive (DI) self-attention for effective feature extraction processing. When inputting high - resolution images, it can effectively reduce the amount of model calculations and capture cross-dimensional information before calculating attention weights. The overwhelming superiority of DI-Unet is demonstrated by extensive experiments in multiple databases. In large datasets, the proposed method outperforms other methods in segmentation tasks. The study provides a research foundation and important reference value for the research and application of Transformer structure in medical image segmentation tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI