计算机科学
人工智能
卷积神经网络
模式识别(心理学)
残余物
特征(语言学)
块(置换群论)
鉴别器
小波
算法
数学
探测器
几何学
语言学
电信
哲学
作者
Zeli Chen,Chuanpu Li,Kaiyi Zheng,Yiwen Zhang,Yuankui Wu,Qianjin Feng,Liming Zhong,Wei Yang
标识
DOI:10.1109/isbi53787.2023.10230486
摘要
Synthesis of Computed Tomography (CT) images from Magnetic Resonance (MR) images is clinical significance for MR-only treatment planning to eliminate the co-registration errors between MR and CT images. Existing convolutional neural network-based methods, suffering from the inherent local inductive biases, struggle to distinguish bone and air which show low signals in conventional MR images. ViT-based methods can learn long-range contextual information by a global self-attention mechanism but are limited by the quadratic complexity and generating local detailed structures. Combining the merits of these two architectures, we propose a hybrid network for MR-to-CT synthesis via global and local feature aggregation from Transformer and CNN, named GLFA-NET. Specifically, we add a global patch embedding branch to supplement the patch-based global representative features directly from the image and design a residual dilated swin transformer block aggregating the local detailed features and global features to improve the synthesis performance of bone and air and reduce computational overhead. Furthermore, we adopt a wavelet PatchGAN discriminator to enhance the high-frequency detailed information of the synthetic CT. Our GLFA-NET was implemented on a dataset with 154 pairs of 3D MR-CT head and neck images. Experiments show that our GLFA-NET achieves impressive performance with MAE of 71.12 ± 10.87, SSIM of 0.771 ± 0.028, and PSNR of 28.91 ± 1.33. The visual synthetic CT results also show that the proposed GLFA-NET method achieves better discrimination of bone and air and higher structural similarity than other state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI