计算机科学
分割
人工智能
推论
水准点(测量)
变压器
乳腺超声检查
计算机视觉
自回归模型
模式识别(心理学)
机器学习
地图学
医学
物理
量子力学
电压
癌症
乳腺癌
乳腺摄影术
内科学
经济
计量经济学
地理
作者
Jialu Li,Qingqing Zheng,Mingshuang Li,Ping Liu,Qiong Wang,Litao Sun,Lei Zhu
标识
DOI:10.1007/978-3-031-16440-8_38
摘要
Automatic breast lesion segmentation in ultrasound (US) videos is an essential prerequisite for early diagnosis and treatment. This challenging task remains under-explored due to the lack of availability of annotated US video dataset. Though recent works have achieved better performance in natural video object segmentation by introducing promising Transformer architectures, they still suffer from spatial inconsistency as well as huge computational costs. Therefore, in this paper, we first present a new benchmark dataset designed for US video segmentation. Then, we propose a dynamic parallel spatial-temporal Transformer (DPSTT) to improve the performance of lesion segmentation in US videos with higher computational efficiency. Specifically, the proposed DPSTT disentangles the non-local Transformer along the temporal and spatial dimensions, respectively. The temporal Transformer attends temporal lesion movement on different frames at the same regions, and the spatial Transformer focuses on similar context information between the previous and the current frames. Furthermore, we propose a dynamic selection scheme to effectively sample the most relevant frames from all the past frames, and thus prevent out of memory during inference. Finally, we conduct extensive experiments to evaluate the efficacy of the proposed DPSTT on the new US video benchmark dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI