HSNet: A hybrid semantic network for polyp segmentation

计算机科学分割卷积神经网络编码器人工智能变压器水准点（测量）语义学（计算机科学）人工神经网络模式识别（心理学）量子力学操作系统大地测量学物理电压程序设计语言地理

作者

Wenchao Zhang,Chong Fu,Yu Zheng,Fang‐Yuan Zhang,Yanli Zhao,Chiu‐Wing Sham

出处

期刊：Computers in Biology and Medicine [Elsevier]
日期：2022-10-05 卷期号：150: 106173-106173 被引量：117

链接

nih.govdoi.org

标识

DOI：10.1016/j.compbiomed.2022.106173

摘要

Automatic polyp segmentation can help physicians to effectively locate polyps (a.k.a. region of interests) in clinical practice, in the way of screening colonoscopy images assisted by neural networks (NN). However, two significant bottlenecks hinder its effectiveness, disappointing physicians' expectations. (1) Changeable polyps in different scaling, orientation, and illumination, bring difficulty in accurate segmentation. (2) Current works building on a dominant decoder-encoder network tend to overlook appearance details (e.g., textures) for a tiny polyp, degrading the accuracy to differentiate polyps. For alleviating the bottlenecks, we investigate a hybrid semantic network (HSNet) that adopts both advantages of Transformer and convolutional neural networks (CNN), aiming at improving polyp segmentation. Our HSNet contains a cross-semantic attention module (CSA), a hybrid semantic complementary module (HSC), and a multi-scale prediction module (MSP). Unlike previous works on segmenting polyps, we newly insert the CSA module, which can fill the gap between low-level and high-level features via an interactive mechanism that exchanges two types of semantics from different NN attentions. By a dual-branch structure of Transformer and CNN, we newly design an HSC module, for capturing both long-range dependencies and local details of appearance. Besides, the MSP module can learn weights for fusing stage-level prediction masks of a decoder. Experimentally, we compared our work with 10 state-of-the-art works, including both recent and classical works, showing improved accuracy (via 7 evaluative metrics) over 5 benchmark datasets, e.g., it achieves 0.926/0.877 mDic/mIoU on Kvasir-SEG, 0.948/0.905 mDic/mIoU on ClinicDB, 0.810/0.735 mDic/mIoU on ColonDB, 0.808/0.74 mDic/mIoU on ETIS, and 0.903/0.839 mDic/mIoU on Endoscene. The proposed model is available at (https://github.com/baiboat/HSNet).

求助该文献

最长约 10秒，即可获得该文献文件

HSNet: A hybrid semantic network for polyp segmentation

今日热心研友