分割
计算机科学
情态动词
编码器
人工智能
计算
参数统计
图像分割
尺度空间分割
计算机视觉
模式识别(心理学)
数学
算法
统计
化学
高分子化学
操作系统
作者
Gustavo Andrade-Miranda,Vincent Jaouen,Olena Tankyevych,Catherine Cheze Le Rest,Dimitris Visvikis,Pierre-Henri Conze
标识
DOI:10.1016/j.compmedimag.2023.102308
摘要
Multi-modal medical image segmentation is a crucial task in oncology that enables the precise localization and quantification of tumors. The aim of this work is to present a meta-analysis of the use of multi-modal medical Transformers for medical image segmentation in oncology, specifically focusing on multi-parametric MR brain tumor segmentation (BraTS2021), and head and neck tumor segmentation using PET-CT images (HECKTOR2021). The multi-modal medical Transformer architectures presented in this work exploit the idea of modality interaction schemes based on visio-linguistic representations: (i) single-stream, where modalities are jointly processed by one Transformer encoder, and (ii) multiple-stream, where the inputs are encoded separately before being jointly modeled. A total of fourteen multi-modal architectures are evaluated using different ranking strategies based on dice similarity coefficient (DSC) and average symmetric surface distance (ASSD) metrics. In addition, cost indicators such as the number of trainable parameters and the number of multiply-accumulate operations (MACs) are reported. The results demonstrate that multi-path hybrid CNN-Transformer-based models improve segmentation accuracy when compared to traditional methods, but come at the cost of increased computation time and potentially larger model size.
科研通智能强力驱动
Strongly Powered by AbleSci AI