Cnn-trans model: A parallel dual-branch network for fundus image classification

计算机科学卷积神经网络对偶（语法数字）深度学习串联（数学）人工智能眼底（子宫）特征（语言学）模式识别（心理学）计算机视觉眼科数学医学哲学艺术文学类组合数学语言学

作者

Shuxian Liu,Wei Wang,Le Deng,Huan Xu

出处

期刊：Biomedical Signal Processing and Control [Elsevier]
日期：2024-07-13 卷期号：96: 106621-106621 被引量：1

标识

DOI：10.1016/j.bspc.2024.106621

摘要

The existence of fundus diseases not only endangers people's vision, but also brings serious economic burden to the society. Fundus images are an objective and standard basis for the diagnosis of fundus diseases. With the continuous advancement of computer science, deep learning methods dominated by convolutional neural networks (CNN) have been widely used in fundus image classification. However, the current CNN-based fundus image classification research still has a lot of room for improvement: CNN cannot effectively avoid the interference of repeated background information and has limited ability to model the whole world. In response to the above findings, this paper proposes the CNN-Trans model. The CNN-Trans model is a parallel dual-branch network, which is the two branches of CNN-LSTM and Vision Transform (ViT). The CNN-LSTM branch uses Xception after transfer learning. As the original feature extractor, LSTM is responsible for dealing with the gradient disappearance problem in neural network iterations before the classification head, and then introduces a new type of lightweight attention mechanism between Xception and LSTM: Coordinate Attention, so as to emphasize the key information related to classification and suppress the less useful repeated background information; while the self-attention mechanism in the ViT branch is not limited by local interactions, it can establish long-distance dependence on the target and extract global features. Finally, the concatenation (Concat) operation is used to fuse the features of the two branches. The local features extracted by the CNN-LSTM branch and the global features extracted by the ViT branch form complementary advantages. After feature fusion, more comprehensive image feature information is sent to the to the classification layer. Finally, after a large number of experimental tests and comparisons, the results show that: the CNN-Trans model achieved an accuracy of 80.68% on the fundus image classification task, and the CNN-Trans model has a classification that is comparable to the state-of-the-art methods. performance..

求助该文献

最长约 10秒，即可获得该文献文件

Cnn-trans model: A parallel dual-branch network for fundus image classification

今日热心研友