计算机科学
人工智能
卷积神经网络
计算机视觉
一致性(知识库)
编码(集合论)
缩放
特征学习
特征提取
代表(政治)
可视化
模式识别(心理学)
特征(语言学)
石油工程
政治
镜头(地质)
工程类
哲学
语言学
集合(抽象数据类型)
程序设计语言
法学
政治学
作者
Xiaohan Xing,Yixuan Yuan,Max Q.‐H. Meng
出处
期刊:IEEE Transactions on Medical Imaging
[Institute of Electrical and Electronics Engineers]
日期:2020-12-01
卷期号:39 (12): 4047-4059
被引量:31
标识
DOI:10.1109/tmi.2020.3010102
摘要
Wireless capsule endoscopy (WCE) is a novel imaging tool that allows noninvasive visualization of the entire gastrointestinal (GI) tract without causing discomfort to patients. Convolutional neural networks (CNNs), though perform favorably against traditional machine learning methods, show limited capacity in WCE image classification due to the small lesions and background interference. To overcome these limits, we propose a two-branch Attention Guided Deformation Network (AGDN) for WCE image classification. Specifically, the attention maps of branch1 are utilized to guide the amplification of lesion regions on the input images of branch2, thus leading to better representation and inspection of the small lesions. What's more, we devise and insert Third-order Long-range Feature Aggregation (TLFA) modules into the network. By capturing long-range dependencies and aggregating contextual features, TLFAs endow the network with a global contextual view and stronger feature representation and discrimination capability. Furthermore, we propose a novel Deformation based Attention Consistency (DAC) loss to refine the attention maps and achieve the mutual promotion of the two branches. Finally, the global feature embeddings from the two branches are fused to make image label predictions. Extensive experiments show that the proposed AGDN outperforms state-of-the-art methods with an overall classification accuracy of 91.29% on two public WCE datasets. The source code is available at https://github.com/hathawayxxh/WCE-AGDN.
科研通智能强力驱动
Strongly Powered by AbleSci AI