计算机科学
人工智能
编码器
图形
图像分割
分割
图像(数学)
计算机视觉
理论计算机科学
操作系统
作者
Yuanhong Jiang,Qiaoqiao Ding,Yu Guang Wang,Píetro Lió,Xiaoqun Zhang
出处
期刊:Inverse Problems and Imaging
[American Institute of Mathematical Sciences]
日期:2024-01-01
卷期号:18 (3): 672-689
被引量:1
摘要
Convolutional neural networks (CNNs) are known for their powerful feature extraction ability, and have achieved great success in a variety of image processing tasks. However, convolution filters only extract local features and neglect long-range self-similarity information, which is the vital prior information commonly existing in image data. To this end, we put forward a new backbone neural network: vision graph U-Net (VGU-Net), which is the first model to construct multi-scale graph structures through the hierarchical down-sampling layers of the U-Net architecture. The graph structure is constructed by the self-attention mechanism. By replacing CNNs in the bottleneck layer and skip connection layers with the graph convolution networks (GCNs), the multi-scale graph structure visualization allows an interpretation of long-range interactions. We extend the VGU-Net backbone model for the widely considered compressed sensing MR image reconstruction task and propose a knowledge-driven deep unrolling scheme based on the half-quadratic splitting algorithm, which combines the interpretability of knowledge-driven model with the versatility of data-driven deep learning method to achieve remarkable reconstruction results. Moreover, we verify the segmentation ability of the VGU-Net backbone model on the multi-modality brain tumor segmentation dataset and white blood cell image segmentation dataset, and both achieve state-of-the-art performance. The code is publicly available at https://github.com/jyh6681/VGU-Net.
科研通智能强力驱动
Strongly Powered by AbleSci AI