计算机科学
人工智能
卷积神经网络
模式识别(心理学)
卷积(计算机科学)
深度学习
一般化
特征提取
上下文图像分类
转化(遗传学)
计算机视觉
人工神经网络
图像(数学)
数学
数学分析
基因
化学
生物化学
作者
Mengjian Zhang,Guihua Wen,Jiahui Zhong,Dongliang Chen,Changjun Wang,Xuhui Huang,Shijun Zhang
出处
期刊:IEEE Journal of Biomedical and Health Informatics
[Institute of Electrical and Electronics Engineers]
日期:2023-07-19
卷期号:27 (9): 4385-4396
被引量:8
标识
DOI:10.1109/jbhi.2023.3292312
摘要
Medical images such as facial and tongue images have been widely used for intelligence-assisted diagnosis, which can be regarded as the multi-label classification task for disease location (DL) and disease nature (DN) of biomedical images. Compared with complicated convolutional neural networks and Transformers for this task, recent MLP-like architectures are not only simple and less computationally expensive, but also have stronger generalization capabilities. However, MLP-like models require better input features from the image. Thus, this study proposes a novel convolution complex transformation MLP-like (CCT-MLP) model for the multi-label DL and DN recognition task for facial and tongue images. Notably, the convolutional Tokenizer and multiple convolutional layers are first used to extract the better shallow features from input biomedical images to make up for the loss of spatial information obtained by the simple MLP structure. Subsequently, the Channel-MLP architecture with complex transformations is used to extract deep-level contextual features. In this way, multi-channel features are extracted and mixed to perform the multi-label classification of the input biomedical images. Experimental results on our constructed multi-label facial and tongue image datasets demonstrate that our method outperforms existing methods in terms of both accuracy (Acc) and mean average precision (mAP).
科研通智能强力驱动
Strongly Powered by AbleSci AI