计算机科学
情态动词
模态(人机交互)
人工智能
医学诊断
对比度(视觉)
光学相干层析成像
医学影像学
模式
水准点(测量)
模式识别(心理学)
机器学习
计算机视觉
医学
放射科
社会科学
化学
大地测量学
社会学
高分子化学
地理
标识
DOI:10.1016/j.bbe.2023.06.001
摘要
Automatic diagnosis of various ophthalmic diseases from ocular medical images is vital to support clinical decisions. Most current methods employ a single imaging modality, especially 2D fundus images. Considering that the diagnosis of ophthalmic diseases can greatly benefit from multiple imaging modalities, this paper further improves the accuracy of diagnosis by effectively utilizing cross-modal data. In this paper, we propose Transformer-based cross-modal multi-contrast network for efficiently fusing color fundus photograph (CFP) and optical coherence tomography (OCT) modality to diagnose ophthalmic diseases. We design multi-contrast learning strategy to extract discriminate features from cross-modal data for diagnosis. Then channel fusion head captures the semantically shared information across different modalities and the similarity features between patients of the same category. Meanwhile, we use a class-balanced training strategy to cope with the situation that medical datasets are usually class-imbalanced. Our method is evaluated on public benchmark datasets for cross-modal ophthalmic disease diagnosis. The experimental results demonstrate that our method outperforms other approaches. The codes and models are available at https://github.com/ecustyy/tcmn.
科研通智能强力驱动
Strongly Powered by AbleSci AI