计算机科学
判别式
人工智能
分割
水准点(测量)
编码器
对象(语法)
相似性(几何)
模式识别(心理学)
编码(集合论)
机器学习
图像(数学)
大地测量学
地理
操作系统
集合(抽象数据类型)
程序设计语言
作者
Shaoteng Zhang,Jianpeng Zhang,Yutong Xie,Yong Xia
标识
DOI:10.1007/978-3-031-43907-0_11
摘要
Most existing weakly-supervised segmentation methods rely on class activation maps (CAM) to generate pseudo-labels for training segmentation models. However, CAM has been criticized for highlighting only the most discriminative parts of the object, leading to poor quality of pseudo-labels. Although some recent methods have attempted to extend CAM to cover more areas, the fundamental problem still needs to be solved. We believe this problem is due to the huge gap between image-level labels and pixel-level predictions and that additional information must be introduced to address this issue. Thus, we propose a text-prompting-based weakly supervised segmentation method (TPRO), which uses text to introduce additional information. TPRO employs a vision and label encoder to generate a similarity map for each image, which serves as our localization map. Pathological knowledge is gathered from the internet and embedded as knowledge features, which are used to guide the image features through a knowledge attention module. Additionally, we employ a deep supervision strategy to utilize the network's shallow information fully. Our approach outperforms other weakly supervised segmentation methods on benchmark datasets LUAD-HistoSeg and BCSS-WSSS datasets, setting a new state of the art. Code is available at: https://github.com/zhangst431/TPRO .
科研通智能强力驱动
Strongly Powered by AbleSci AI