计算机科学
联营
人工智能
提取器
新闻聚合器
模式识别(心理学)
变压器
数据挖掘
电压
工程类
物理
量子力学
工艺工程
操作系统
作者
Haijing Luan,Taiyuan Hu,Jifang Hu,Ruilin Li,Detao Ji,Jiayin He,Xiaohong Duan,Chunyan Yang,Ya‐Jun Gao,Fan Chen,Beifang Niu
标识
DOI:10.1007/978-981-99-7074-2_12
摘要
Whole slide images (WSIs) are high-resolution and lack localized annotations, whose classification can be treated as a multiple instance learning (MIL) problem while slide-level labels are available. We introduce a approach for WSI classification that leverages the MIL and Transformer, effectively eliminating the requirement for localized annotations. Our method consists of three key components. Firstly, we use ResNet50, which has been pre-trained on ImageNet, as an instance feature extractor. Secondly, we present a Transformer-based MIL aggregator that adeptly captures contextual information within individual regions and correlation information among diverse regions within the WSI. Thirdly, we introduce the global average pooling (GAP) layer to increase the mapping relationship between WSI features and category features. To evaluate our model, we conducted experiments on the The Cancer Imaging Archive (TCIA) Clinical Proteomic Tumor Analysis Consortium (CPTAC) dataset. Our proposed method achieves a top-1 accuracy of 94.8% and an area under the curve (AUC) exceeding 0.996, establishing state-of-the-art performance in WSI classification without reliance on localized annotations. The results demonstrate the superiority of our approach compared to previous MIL-based methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI