计算机科学
地标
人工智能
图形
卷积(计算机科学)
模式识别(心理学)
计算
嵌入
特征(语言学)
图嵌入
计算机视觉
理论计算机科学
人工神经网络
算法
语言学
哲学
作者
Chujie Xu,Wenjie Zheng,Yong Du,Tiejun Li,Zhan-Sheng Yuan
摘要
Video‐based facial expression recognition (FER) models have achieved higher accuracy with more computation, which is not suitable for online deployment in mobile intelligent terminals. Facial landmarks can model facial expression changes with their spatial location information instead of texture features. But classical convolution operation cannot make full use of landmark information. To this end, in this paper, we propose a novel long short memory network (LSTM) by embedding graph convolution named GELSTM for online video‐based FER in mobile intelligent terminals. Specifically, we construct landmark‐based face graph data from the client. On the server side, we introduce graph convolution which can effectively mine spatial dependencies information in a landmark‐based facial graph. Moreover, the extracted landmark's features are fed to LSTM for temporal feature aggregation. We conduct experiments on the facial expression dataset and the results show our proposed method shows superior performance compared to other deep models.
科研通智能强力驱动
Strongly Powered by AbleSci AI