分割
计算机科学
残余物
人工智能
模式识别(心理学)
特征(语言学)
融合
编码器
舌头
图像分割
算法
语言学
操作系统
哲学
作者
Haibei Song,Zonghai Huang,Li Feng,Yanmei Zhong,Chuanbiao Wen,Jinhong Guo
出处
期刊:Digital health
[SAGE]
日期:2022-01-01
卷期号:8: 205520762211363-205520762211363
被引量:5
标识
DOI:10.1177/20552076221136362
摘要
Due to the complexity of face images, tongue segmentation is susceptible to interference from uneven tongue texture, lips and face, resulting in traditional methods failing to segment the tongue accurately. To address this problem, RAFF-Net, an automatic tongue region segmentation network based on residual attention network and multiscale feature fusion, was proposed. It aims to improve tongue segmentation accuracy and achieve end-to-end automated segmentation.Based on the UNet backbone network, different numbers of ResBlocks combined with the Squeeze-and-Excitation (SE) block was used as an encoder to extract image layered features. The decoder structure of UNet was simplified and the number of parameters of the network model was reduced. Meanwhile, the multiscale feature fusion module was designed to optimize the network parameters by combining a custom loss function instead of the common cross-entropy loss function to further improve the detection accuracy.The RAFF-Net network structure achieved Mean Intersection over Union (MIoU) and F1-score of 97.85% and 97.73%, respectively, which improved 0.56% and 0.46%, respectively, compared with the original UNet; ablation experiments demonstrated that the improved algorithm could contribute to the enhancement of tongue segmentation effect.This study combined the residual attention network with multiscale feature fusion to effectively improve the segmentation accuracy of the tongue region, and optimized the input and output of the UNet network using different numbers of ResBlocks, SE block, multiscale feature fusion and weighted loss function, increased the stability of the network and improved the overall effect of the network.
科研通智能强力驱动
Strongly Powered by AbleSci AI