计算机科学
增采样
人工智能
分割
特征(语言学)
图像分割
模式识别(心理学)
深度学习
卷积神经网络
卷积(计算机科学)
基于分割的对象分类
尺度空间分割
舌头
像素
编码器
图像(数学)
计算机视觉
算法
人工神经网络
哲学
语言学
操作系统
作者
Chang Tian,Yanjung Liu,Meng Li,Chaofan Fen
出处
期刊:Journal of Intelligent and Fuzzy Systems
[IOS Press]
日期:2023-07-02
卷期号:45 (1): 1473-1480
摘要
The key step in the intelligence of tongue diagnosis is the segmentation of the tongue image, and the accuracy of the segmented edges has a significant impact on the subsequent medical judgment. Deep learning can predict the class of pixel points to achieve pixel-level segmentation of images, so it can be used to handle tongue segmentation tasks. However, different models have different segmentation effects, and they did not learn the connection between space and channels, resulting in inaccurate tongue segmentation. This paper first discussed the choice of model and loss function and then compared the results of different options to find the better model. Associating the red feature of the tongue is very conducive to segmentation as a feature, this paper tested many methods to try to get the color features of the original image to be paid attention to. Finally, this paper proposed an improved Encoder-Decoder network model to solve the problem based on the results. Start with Resnet as the backbone network, then introduce the U-Net model, and then we fused the attention layer, obtained from the source image through convolution and CBAM attention mechanism, and the feature layer obtained from the last upsampling in U-Net. Experimental results show that: The new, improved algorithm results are 2-3 percentage points higher than the popular algorithm, making it more suitable for tongue segmentation tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI