计算机科学
人工智能
分割
卷积神经网络
图像分割
对角线的
卷积(计算机科学)
计算机视觉
变压器
模式识别(心理学)
残余物
人工神经网络
算法
数学
工程类
电压
电气工程
几何学
作者
Shoukun Xu,D. Xiao,Baozong Yuan,Yi Liu,Xueyuan Wang,Ning Li,Lin Shi,Jialu Chen,Ju-Xiao Zhang,Yanhao Wang,Jianfeng Cao,Yeqin Shao,Mingjie Jiang
标识
DOI:10.1016/j.compbiomed.2023.107567
摘要
Medical image segmentation is crucial for accurate diagnosis and treatment in the medical field. In recent years, convolutional neural networks (CNNs) and Transformers have been frequently adopted as network architectures in medical image segmentation. The convolution operation is limited in modeling long-range dependencies because it can only extract local information through the limited receptive field. In comparison, Transformers demonstrate excellent capability in modeling long-range dependencies but are less effective in capturing local information. Hence, effectively modeling long-range dependencies while preserving local information is essential for accurate medical image segmentation. In this paper, we propose a four-axis fusion framework called FAFuse, which can exploit the advantages of CNN and Transformer. As the core component of our FAFuse, a Four-Axis Fusion module (FAF) is proposed to efficiently fuse global and local information. FAF combines Four-Axis attention (height, width, main diagonal, and counter diagonal axial attention), a multi-scale convolution, and a residual structure with a depth-separable convolution and a Hadamard product. Furthermore, we also introduce deep supervision to enhance gradient flow and improve overall performance. Our approach achieves state-of-the-art segmentation accuracy on three publicly available medical image segmentation datasets. The code is available at https://github.com/cczu-xiao/FAFuse.
科研通智能强力驱动
Strongly Powered by AbleSci AI