人工智能
计算机科学
神经模糊
深度学习
模糊逻辑
比例(比率)
学习迁移
领域(数学分析)
模式识别(心理学)
模糊控制系统
机器学习
数学
数学分析
物理
量子力学
作者
Qiankun Li,Yimou Wang,Yani Zhang,Zhihong Zuo,Junxin Chen,Wei Wang
出处
期刊:IEEE Transactions on Fuzzy Systems
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:: 1-12
标识
DOI:10.1109/tfuzz.2024.3400861
摘要
The surge in visual general big data has notably advanced data-driven deep learning-based computer vision technologies. Transformer-based methods shine in this era of big data because of their attention mechanism architecture and demand for massive data. However, the difficulty of obtaining medical images has caused the field to continue facing the limited-data challenge. In this paper, we propose a novel deep neuro-fuzzy system named Fuzzy-ViT, which synergistically integrates fuzzy logic with the Vision Transformer (ViT) for cross-domain transfer learning from large-scale general data to medical image domain. Specifically, Fuzzy-ViT utilizes a ViT backbone pre-trained on extensive general datasets such as ImageNet-21K, LAION-400M, and LAION-2B to extract rich general features. Then, a Fuzzy Attention Cross-Domain Module (FACM) is presented to transfer general features to medical features, thereby enhancing the medical image analysis. Thanks to the Fuzzy System Transitioner (FST) in FACM, fuzzy and uninterpretable general domain features can be effectively converted into those needed in the medical domain. In addition, the Attention Mechanism Smoother (AMS) in FACM smoothes the conversion outcomes, ensuring a harmonious integration of the fuzzy system with the neural network architecture. Experimental results demonstrate that the proposed Fuzzy-ViT achieves state-of-the-art and satisfactory performance on popular medical image benchmarks (BreakHis and HCRF) with 93.37% and 97.22% F1 scores. Detailed ablation analysis demonstrates that the effectiveness of our method for bridging large general visual and medical images.
科研通智能强力驱动
Strongly Powered by AbleSci AI