面部表情
计算机科学
地标
人工智能
稳健性(进化)
图形
模式识别(心理学)
计算机视觉
理论计算机科学
生物化学
基因
化学
作者
Rui Zhao,Tianshan Liu,Zixun Huang,Daniel Pak-Kong Lun,Kin‐Man Lam
标识
DOI:10.1109/taffc.2022.3181736
摘要
Facial expression recognition (FER) is of great interest to the current studies of human-computer interaction. In this paper, we propose a novel geometry-guided facial expression recognition framework, based on graph convolutional networks and transformers, to perform effective emotion recognition from videos. Specifically, we detect and utilize facial landmarks to construct a spatial-temporal graph, based on both the landmark coordinates and local appearance, for representing a facial expression sequence. The graph convolutional blocks and transformer modules are employed to produce high-semantic emotion-related representations from the structured facial graphs, which facilitate the framework to establish both the local and non-local dependency between the vertices. Moreover, spatial and temporal attention mechanisms are introduced into graph-based learning to promote FER reasoning, via the emphasis on the most informative facial components and frames. Extensive experiments demonstrate that the proposed framework achieves promising performance for geometry-based FER and shows great generalization and robustness in real-world applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI