面部表情
计算机科学
人工智能
卷积神经网络
模式识别(心理学)
表达式(计算机科学)
任务(项目管理)
特征(语言学)
频道(广播)
机器学习
多任务学习
观点
面部表情识别
语音识别
面部识别系统
工程类
视觉艺术
艺术
哲学
程序设计语言
系统工程
语言学
计算机网络
作者
Jingying Chen,Yang Lei,Lei Tan,Ruyi Xu
标识
DOI:10.1016/j.patcog.2022.108753
摘要
Multi-view facial expression recognition (FER) is a challenging computer vision task due to the large intra-class difference caused by viewpoint variations. This paper presents a novel orthogonal channel attention-based multi-task learning (OCA-MTL) approach for FER. The proposed OCA-MTL approach adopts a Siamese convolutional neural network (CNN) to force the multi-view expression recognition model to learn the same features as the frontal expression recognition model. To further enhance the recognition accuracy of non-frontal expression, the multi-view expression model adopts a multi-task learning framework that regards head pose estimation (HPE) as an auxiliary task. A separated channel attention (SCA) module is embedded in the multi-task learning framework to generate individual attention for FER and HPE. Furthermore, orthogonal channel attention loss is presented to force the model to employ different feature channels to represent the facial expression and head pose, thereby decoupling them. The proposed approach is performed on two public facial expression datasets to evaluate its effectiveness and achieves an average recognition accuracy rate of 88.41% under 13 viewpoints on Multi-PIE and 89.04% under 5 viewpoints on KDEF, outperforming state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI