判别式
人工智能
计算机科学
体积热力学
多标签分类
嵌入
光学相干层析成像
模式识别(心理学)
自编码
编码器
上下文图像分类
深度学习
机器学习
图像(数学)
放射科
医学
物理
量子力学
操作系统
作者
Yuhan Zhang,Ziqi Tang,Dawei Yang,An Ran Ran,Carol Y. Cheung,Pheng‐Ann Heng
标识
DOI:10.1109/isbi53787.2023.10230396
摘要
Optical coherence tomography (OCT) technique can produce volumetric data of the retina for disease diagnosis. Each OCT volume consists of 2D B-scans, and recent studies have shown remarkable success with deep learning for single-label B-scan classification tasks. However, B-scan annotation is quite difficult and single-label classification approaches cannot meet the growing clinical demands. It is attractive to develop a multi-label classification approach for disease diagnosis only using volume-level labels. In this paper, we propose a label-volume contrastive learning (LVCL) for the multi-label classification of OCT volumes. In LVCL, a robust volume encoder (RVE) is proposed to convert an OCT volume into its volume embedding by modeling the local and global dependencies among B-scans. Meanwhile, a label encoder outputs discriminative label embeddings for all pathological labels. Lastly, the correspondence between volume embedding and label embeddings is learned by contrastive learning and our proposed loss function. Experiments on 5121 OCT volumes of 1244 patients show superior multi-label volume classification performance than other approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI