人工智能
计算机科学
分割
图像分割
模糊逻辑
模式识别(心理学)
计算机视觉
像素
基于分割的对象分类
尺度空间分割
特征(语言学)
语言学
哲学
作者
Lei Yang,Shilin Wang,Alan Wee‐Chung Liew
出处
期刊:IEEE Transactions on Fuzzy Systems
[Institute of Electrical and Electronics Engineers]
日期:2023-07-24
卷期号:32 (2): 349-359
被引量:4
标识
DOI:10.1109/tfuzz.2023.3298323
摘要
Fine-grained lip image segmentation plays a critical role in downstream tasks, such as automatic lipreading, as it enables the accurate identification of inner mouth components, such as teeth and tongue, which are essential for comprehending spoken utterances. However, achieving accurate and robust lip image segmentation in natural scenes is still challenging due to significant variations in lighting condition, head pose, and background. This article proposes a novel deep neural network-based method for fine-grained lip image segmentation that exploits fuzzy and graph theories to handle these variations. A fuzzy learning module is designed to deal with the uncertainties in color and edge information and enhance feature maps at various scales. The fuzzy graph reasoning module with fuzzy projection models the relationship among semantics components and achieves a global receptive field. In our experiments, a fine-grained lip region segmentation dataset, i.e., FLRSeg, is built for evaluation, and experimental results have shown that the proposed method can achieve superior segmentation performance (94.36% in pixel accuracy and 74.89% in mIoU) compared with several state-of-the-art (SOTA) lip image segmentation methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI